Published on 01 January 2025

Supporting data for “Words in Minds and Machines: A Computational Characterization of Chinese Mental Lexicon”

View Dataset
Wang, Tianqi

Description

Supporting data for “Words in Minds and Machines: A Computational Characterization of Chinese Mental Lexicon” are made available on HKU DataHub. In this research project, I took advantage of distributional semantic models to address the central question of how we mentally represent lexical semantic information. With a special focus on Chinese, I investigated a number of research topics including (i) compound compositionality, (ii) lexical ambiguity, and (iii) the extrapolation of semantic rating norms (i.e., valence, arousal, and concreteness) that could reflect individual differences. As no human participants were recruited for the purpose of the series of studies, only data derived from computational models, as well as the codes to implement the computations are provided. Researchers can utilize the data and codes to replicate the results described in this thesis. For details of the supporting data, please refer to the README file.

Citations (0)

Mentions (0)

Metrics

Dataset Index

1.2

FAIR Score

56%

Citations

0

Mentions

0

Metrics Over Time

Publication Details

DOI

Publisher

HKU Data Repository

Assigned Domain

Subfield

Language and Linguistics

Field

Arts and Humanities

Domain

Social Sciences

Confidence Score

59%

Source

Scholar Data Model

Keywords

Psycholinguistics (incl. speech production and comprehension)Natural language processingComputational linguisticsCognitive and computational psychology not elsewhere classifiedLexicography and semantics

Normalization Factors

FT

15.38

CTw

1.00

MTw

1.00