PastReader 2025

View Dataset
Montejo-Ráez, Arturo;Sánchez Nogales, Elena;Expósito Álvarez, Gloria;Ureña López, Alfonso;Martín-Valdivia, M. Teresa;Collado-Montañez, Jaime;Cabrera de Castro, Isabel;Cantero Romero, María Victoria;García Serrano, Ana;Ortuño Casanova, Rocío;Torterolo Orta, Yanco Amor

Description

This is the dataset used in the PastReader 2025 shared task at IberLEF 2025.This dataset has been generated from the historical press publications in the public domain, digitized by the National Library of Spain (BNE) and available for free access through Hemeroteca Digital. The BNE has undertaken efforts to improve these resulting texts through various approaches. One of the most significant initiatives involves open and collaborative OCR correction (among other types of projects) through the ComunidadBNE platform: https://comunidad.bne.es/.

Citations (0)

Mentions (0)

Metrics

Dataset Index

1.7

FAIR Score

69%

Citations

0

Mentions

0

Metrics Over Time

Publication Details

DOI

Publisher

Zenodo

Assigned Domain

Subfield

Sociology and Political Science

Field

Social Sciences

Domain

Social Sciences

Confidence Score

35%

Source

Scholar Data Model

Normalization Factors

FT

13.46

CTw

1.00

MTw

1.00