Published on 01 January 2017

Outcomes of SAVE-SD 2015 and 2016 questionnaires on RASH and analysis of RDF annotations in the RASH papers.

View Dataset
Osborne, Francesco;Peroni, Silvio

Description

This dataset contains all the source materials and the data collected for the evaluation of RASH (Research Articles in Simplified HTML), which is presented in the paper:
Peroni, S., Osborne, F., Di Iorio, A., Nuzzolese, A., Poggi, F., Vitali, F., Motta, E. (2017). Research Articles in Simplified HTML: a Web-first format for HTML-based scholarly articles. https://w3id.org/people/essepuntato/papers/rash-peerj2016.html
submitted to the PeerJ Computer Science.

In particular this archive contains seven items:- the file "README.txt" (this file);- the directory "save-sd2015", containing the source of six RASH articles submitted to SAVE-SD 2015 and their related RDF statements extracted from them and stored as Turtle files;- the directory "save-sd2016", containing the source of five RASH articles submitted to SAVE-SD 2015 and their related RDF statements extracted from them and stored as Turtle files;- the directory "stats", containing CSVf files with data about the RDF statements extracted from the RASH papers presented in the two edition of SAVE-SD.- the directory "script" containing the Python scripts creating the CSV files;- the file "script.sh" that runs the computation for creating all the statistics stored in the directory "stats";- the directory "questionnaires", containing four CSV files reporting the questionnaires filled in by authors and reviewers of RASH papers published in the SAVE-SD 2015 and SAVE-SD 2016 workshops.
In particular, the "stats" directory contains two directories describing the data about the statements of the RASH papers of SAVE-SD 2015 ("2015") and of SAVE-SD 2016 ("2016"). A summary of all these data is provided in the directory "tot".
Each of these directories contains four distinct CSV files:- "stats_short.csv" contains all the numeric data related to the vocabularies used in the statements and the way they have been involved in the RASH papers;- "stats.csv" extends the previous CSV file by adding also all the information related to each of the entities involved per vocabulary;- "stats_perc.csv" contains the percentages of the vocabularies used in the statements and the way they have been involved in the RASH papers;- "stats_short_perc.csv" extends the previous CSV file by adding also all the information related to each of the entities involved per vocabulary.
In the CSV tables, the last columns are dedicated to some metrics calculated starting from the values specified for each vocabulary/entity involved in the papers. In particular:- "TOTAL" is the sum of all the statements indicated in a row;- "mean" is the arithmetic mean of all the statements indicated in a row;- "std" is the standard deviation of related to the arithmetic mean;- "sqrt" is the sum of all the square root values of all the statements indicated in a row;- "log" is the sum of all the natural logarithm values of all the statements indicated in a row.

For any question about the data please contact [email protected] or [email protected]

Citations (7)

Mentions (0)

Metrics

Dataset Index

4.3

FAIR Score

81%

Citations

7

Mentions

0

Metrics Over Time

Publication Details

DOI

Publisher

figshare

Assigned Domain

Subfield

Artificial Intelligence

Field

Computer Science

Domain

Physical Sciences

Confidence Score

87%

Source

Open Alex

Keywords

Computer SoftwareFOS: Computer and information sciences80602 Computer-Human InteractionFOS: PsychologyData FormatLibrary and Information StudiesFOS: Media and communications80404 Markup Languages80306 Open Software80505 Web Technologies (excl. Web Search)

Normalization Factors

FT

13.46

CTw

1.00

MTw

1.00