Scholar Data

Datasets

A digital forensics corpus representing the view of academics and practitioners 1999-2021

A significant challenge in digital forensics is the lack of a framework for common language and knowledge. This creates barriers to communicating, collaborating and knowledge sharing amongst stakeholders. Methods for creating a comprehensive set of common terms on a topic includes Natural Language Processing (NLP) and Generative Artificial Intelligence (GenAI) algorithms. The efficiency of these algorithms depends on the coverage, quality and quantity of the training corpus. As far as we know, there is no such corpus that is readily available for training these algorithms.This is a digital forensics practice and research corpus, validated by practitioners working in this domain. The corpus is ready for training new generations of NLP and GenAI algorithms. The associated paper also presents a systematic method of sharing a training corpus, where the data structure, such as folder and file names, make it convenient to programmatically interact with the data.

Authors

Santo, Farhan Tanvir ;
Puch-Solis, Roberto ;
Le Gall, Maël ;
Cole, Christian ;
Haynes, David ;
NicDaeid, Niamh

0 Citations0 Mentions35% FAIR0.2 Dataset Index

10.15132/10000252January 2024

Automated Author Profile
Haynes, David
Edinburgh Napier University
0000-0001-9191-9247

Haynes, David

Current S-Index

Average Dataset Index per Dataset

Total Datasets

Average FAIR Score

Total Citations

Total Mentions

S-Index Interpretation

S-Index Over Time

Cumulative Citations Over Time

Cumulative Mentions Over Time

Datasets

A digital forensics corpus representing the view of academics and practitioners 1999-2021

Automated Author ProfileHaynes, DavidEdinburgh Napier University0000-0001-9191-9247

Haynes, David

Current S-Index

Average Dataset Index per Dataset

Total Datasets

Average FAIR Score

Total Citations

Total Mentions

S-Index Interpretation

S-Index Over Time

Cumulative Citations Over Time

Cumulative Mentions Over Time

Datasets

A digital forensics corpus representing the view of academics and practitioners 1999-2021

Automated Author Profile
Haynes, David
Edinburgh Napier University
0000-0001-9191-9247