Published on 01 January 2021

Corpus CSV

View Dataset
Friedrich Kuwaki, Vinicius Takeo

Description

This file describes the corpus in a CSV format using a comma as separator. The file includes the following columns:
- en: The words in English that composes the sentence;- pt_br: The words in Portuguese that composes the sentence;- type: The type of the sentence (OBJ for objective and SUBJ for subjective);- pol: The polarity of the sentence if it is a subjective sentence (-1, 0 or 1).- en_path: The path in OpenSubtitles related to the sentence in English;- pt_br_path: The path in OpenSubtitles related to the sentence in Portuguese;

Citations (0)

Mentions (0)

Metrics

Dataset Index

0.3

FAIR Score

13%

Citations

0

Mentions

0

Metrics Over Time

Publication Details

DOI

Publisher

figshare

Assigned Domain

Subfield

Language and Linguistics

Field

Arts and Humanities

Domain

Social Sciences

Confidence Score

42%

Source

Scholar Data Model

Keywords

80107 Natural Language ProcessingFOS: Computer and information sciences

Normalization Factors

FT

15.38

CTw

1.00

MTw

1.00