Automated Author Profile

Haynes, David

Edinburgh Napier University
0000-0001-9191-9247

Current S-Index

0.2

Sum of Dataset Indices for all datasets

Average Dataset Index per Dataset

0.2

Average Dataset Index per dataset

Total Datasets

1

Total datasets for this author

Average FAIR Score

34.6%

Average FAIR Score per dataset

Total Citations

0

Total citations to the author's datasets

Total Mentions

0

Total mentions of the author's datasets

S-Index Interpretation

S-Index Over Time

Cumulative Citations Over Time

Cumulative Mentions Over Time

Datasets

A digital forensics corpus representing the view of academics and practitioners 1999-2021

A significant challenge in digital forensics is the lack of a framework for common language and knowledge. This creates barriers to communicating, collaborating and knowledge sharing amongst stakeholders. Methods for creating a comprehensive set of common terms on a topic includes Natural Language Processing (NLP) and Generative Artificial Intelligence (GenAI) algorithms. The efficiency of these algorithms depends on the coverage, quality and quantity of the training corpus. As far as we know, there is no such corpus that is readily available for training these algorithms.This is a digital forensics practice and research corpus, validated by practitioners working in this domain. The corpus is ready for training new generations of NLP and GenAI algorithms. The associated paper also presents a systematic method of sharing a training corpus, where the data structure, such as folder and file names, make it convenient to programmatically interact with the data.

Authors

  • Santo, Farhan Tanvir ;
  • Puch-Solis, Roberto ;
  • Le Gall, Maël ;
  • Cole, Christian ;
  • Haynes, David ;
  • NicDaeid, Niamh
0 Citations0 Mentions35% FAIR0.2 Dataset Index
10.15132/10000252January 2024