Published on 23 October 2020

CORD-19 SciSpaCy Entity Dataset

View Dataset
Pal, Sujit

Description

Dataset of biomedical entities extracted from the CORD-19 dataset (2020-08-28 and 2020-09-28) using trained NER (trained against CRAFT, JNLPBA, BC5CDR, and BioNLP) and NERL models (UMLS, MeSH, GO, HPO, and RxNorm) from the SciSpaCy project, provided as structured Parquet files. Dataset may be useful for downstream tasks around entity linking and relationship extraction. The work was carried out using Dask on the Saturn Cloud platform, and was a joint effort between Elsevier Labs and Saturn Cloud. Dataset available at: s3://els-labs-website/cord19-scispacy-entities/

Citations (0)

Mentions (3)

Metrics

Dataset Index

2.9

FAIR Score

65%

Citations

0

Mentions

3

Metrics Over Time

Publication Details

DOI

Publisher

Mendeley

Assigned Domain

Subfield

Radiology, Nuclear Medicine and Imaging

Field

Medicine

Domain

Health Sciences

Confidence Score

93%

Source

Open Alex

Keywords

Natural Language ProcessingBiomedical Discipline

Normalization Factors

FT

15.38

CTw

1.00

MTw

1.00