Published on 25 October 2021 |

Version 0.1

German CBOW FastText embeddings with min count 250

View Dataset
Bocharov, Victor

Description

FastText embeddings built from Common Crawl german dataset Parameters Parameters Value(s) Dimensions 256 and 384 Context window 5 Negative sampled 10 Epochs 1 Number of buckets 131072 or 262144 Min n 3 Max n 6

Citations (0)

Mentions (0)

Metrics

Dataset Index

1.8

FAIR Score

73%

Citations

0

Mentions

0

Metrics Over Time

Publication Details

DOI

Publisher

Zenodo

Assigned Domain

Subfield

Sociology and Political Science

Field

Social Sciences

Domain

Social Sciences

Confidence Score

39%

Source

Open Alex

Keywords

Word embeddingsfasttextCommon CrawlGerman

Normalization Factors

FT

13.46

CTw

1.00

MTw

1.00