Version latest

MIMIC-IV-Note: Deidentified free-text clinical notes

View Dataset
Johnson, Alistair;Pollard, Tom;Horng, Steven;Celi, Leo Anthony;Mark, Roger

Description

The advent of large, open access text databases has driven advances in state-of-the-art model performance in natural language processing (NLP). Therelatively limited amount of clinical data available for NLP has been cited asa significant barrier to the field's progress. Here we describe MIMIC-IV-Note:a collection of deidentified free-text clinical notes for patients included inthe MIMIC-IV clinical database. MIMIC-IV-Note contains 331,794 deidentifieddischarge summaries from 145,915 patients admitted to the hospital andemergency department at the Beth Israel Deaconess Medical Center in Boston,MA, USA. The database also contains 2,321,355 deidentified radiology reportsfor 237,427 patients. All notes have had protected health information removedin accordance with the Health Insurance Portability and Accountability Act(HIPAA) Safe Harbor provision. All notes are linkable to MIMIC-IV providingimportant context to the clinical data therein. The database is intended tostimulate research in clinical natural language processing and associatedareas.

Citations (0)

Mentions (0)

Metrics

Dataset Index

0.8

FAIR Score

73%

Citations

0

Mentions

0

Metrics Over Time

Publication Details

DOI

Publisher

PhysioNet

Assigned Domain

Subfield

Plant Science

Field

Agricultural and Biological Sciences

Domain

Life Sciences

Confidence Score

50%

Source

Open Alex

Normalization Factors

FT

30.77

CTw

1.00

MTw

1.00