Published on 13 May 2024 |

Version 1.1

Publication dates for ArXiv publication versions

View Dataset
Druskat, Stephan

Description

Lookup tables in plain JSON, mapping ArXiv publication version identifiers to their respective publications dates.The JSON files are archived in arxiv-publication-dates-by-identifier-prefix.tar.gz.The archive contains files named after the date prefix of the ArXiv publication version identifiers they contain.E.g., the file 1908.json will contain the data for identifiers 1908.12345v1, 1908.12345v2, 1908.23456v1, etc.Publication dates are given in the format YYYY-MM-DD.ReproducibilityThe Snakemake workflow that has produced this dataset has been archived and is available in arxiv-publication-dates-workflow.tar.gz.Changes in version 1.1For version 1.1, the dataset was extended manually to include a single missing date for arXiv:0906.3421v3: 2010-02-02. As of 2024-05-13, the date for the respective version had not been provided in the arXivRaw OAI-PMH data (http://export.arxiv.org/oai2?verb=GetRecord&identifier=oai:arXiv.org:0906.3421&metadataPrefix=arXivRaw).Running the workflowTo reproduce the dataset on a Linux machine, you need a version of the conda package manager installed on your system.Run the following:# Extract the archived workflowtar -xf my-workflow.tar.gz# Create conda environment from lock fileconda env create -n arxiv-metadata --file conda-environment.lock.yaml# Activate the environmentconda activate arxiv-metadata# Optionally, dry-run the workflowsnakemake -n# Produce the output filessnakemake --keep-storage-local-copies --software-deployment-method conda -c Then, append the file 0906.json (included in the tar.gz output) with value 2010-02-02 for a new key 0906.3421v3.WorkflowTo adapt/change the workflow, clone it from https://github.com/sdruskat/arxiv-publication-metadata.The workflow version used to produce this dataset is available at https://doi.org/10.5281/zenodo.11091617.

Citations (0)

Mentions (0)

Metrics

Dataset Index

1.5

FAIR Score

69%

Citations

0

Mentions

0

Metrics Over Time

Publication Details

DOI

Publisher

Zenodo

Assigned Domain

Subfield

Cellular and Molecular Neuroscience

Field

Neuroscience

Domain

Life Sciences

Confidence Score

54%

Source

Open Alex

Normalization Factors

FT

15.38

CTw

1.00

MTw

1.00