Automated Author ProfileJana Šindlerová
Jana Šindlerová
Current S-Index
Sum of Dataset Indices for all datasets
Average Dataset Index per Dataset
Average Dataset Index per dataset
Total Datasets
Total datasets for this author
Average FAIR Score
Average FAIR Score per dataset
Total Citations
Total citations to the author's datasets
Total Mentions
Total mentions of the author's datasets
S-Index Interpretation
The S-Index (Sharing Index) is a comprehensive metric that represents the cumulative impact of all your datasets. It is calculated as the sum of Dataset Index scores across all your claimed datasets.
What it means:
- A higher S-index indicates greater overall impact of your datasets relative to typical datasets in their fields of research
- The S-Index grows as you add more datasets or as existing datasets gain more citations and mentions
- It provides a single number to track your research data impact over time
Current S-Index: 0.8 (sum of 1 dataset Dataset Index scores)
More information here.
S-Index Over Time
Cumulative Citations Over Time
Cumulative Mentions Over Time
Datasets
Introduction
Prague Czech-English Dependency Treebank (PCEDT) 2.0 was developed by the Institute of Formal and Applied Linguistics at Charles University in Prague, Czech Republic. It is a corpus of Czech-English parallel resources translated, aligned and manually annotated for dependency structure, semantic labeling, argument structure, ellipsis and anaphora resolution. This release updates Prague Czech-English Dependency Treebank 1.0 (LDC2004T25) by adding English newswire texts so that it now contains over two million words in close to 100,000 sentences.
Data
The principal new material in PCEDT 2.0 is the inclusion of the entire Wall Street Journal data from Treebank-3 (LDC99T42). Not included from PCEDT 1.0 are the Readers Digest material, the Czech monolingual corpus, and the English-Czech dictionary.
Each section is enhanced with a comprehensive manual linguistic annotation in the Prague Dependency Treebank style (LDC2006T01, Prague Dependency Treebank 2.0). The main features of this annotation style are:
- dependency structure of the content words and coordinating and similar structures (function words are attached as their attribute values)
- semantic labeling of content words and types of coordinating structures
- argument structure, including an argument structure (valency) lexicon for both languages
- ellipsis and anaphora resolution
This annotation style is called tectogrammatical annotation, and it constitutes the tectogrammatical layer in the corpus.
Please consult the PCEDT website for more information and documentation.
Samples
Please follow this link for a sample of the data included.
Updates
None at this time.
Portions © 1987-1989 Dow Jones & Company, Inc., © 2002-2012 Charles University in Prague, Institute of Formal and Applied Linguistics, © 1999, 2004, 2012 Trustees of the University of Pennsylvania
Authors
- Eva Hajičová ;
- Jarmila Panevová ;
- Sgall, Petr ;
- Silvie Cinková ;
- Eva Fučíková ;
- Marie Mikulová ;
- Pajas, Petr ;
- Popelka, Jan ;
- Jiří Semecký ;
- Jana Šindlerová ;
- Jan Štěpánek ;
- Toman, Josef ;
- Zdeňka Urešová ;
- Zdeněk Žabokrtský ;
- Hajič, Jan