Version 2.1

MIMIC-IV-Note: Deidentified free-text clinical notes

View Dataset
Johnson, Alistair;Pollard, Tom;Horng, Steven;Celi, Leo Anthony;Mark, Roger

Description

The advent of large, open access text databases has driven advances in state-of-the-art model performance in natural language processing (NLP). Therelatively limited amount of clinical data available for NLP has been cited asa significant barrier to the field's progress. Here we describe MIMIC-IV-Note:a collection of deidentified free-text clinical notes for patients included inthe MIMIC-IV clinical database. MIMIC-IV-Note contains 357,289 deidentifieddischarge summaries from 161,403 patients admitted to the hospital andemergency department at the Beth Israel Deaconess Medical Center in Boston,MA, USA. The database also contains 2,471,881 deidentified radiology reportsfor 256,400 patients. All notes have had protected health information removedin accordance with the Health Insurance Portability and Accountability Act(HIPAA) Safe Harbor provision. All notes are linkable to MIMIC-IV providingimportant context to the clinical data therein. The database is intended tostimulate research in clinical natural language processing and associatedareas.

Citations (0)

Mentions (0)

Metrics

Dataset Index

1.8

FAIR Score

73%

Citations

0

Mentions

0

Metrics Over Time

Publication Details

DOI

Publisher

PhysioNet

Assigned Domain

Subfield

Geriatrics and Gerontology

Field

Medicine

Domain

Health Sciences

Confidence Score

41%

Source

Scholar Data Model

Normalization Factors

FT

13.46

CTw

1.00

MTw

1.00