Automated Organization ProfileVavilov Institute of General Genetics
Vavilov Institute of General Genetics
Current S-Index
Sum of Dataset Indices for all datasets
Average Dataset Index per Dataset
Average Dataset Index per dataset
Total Datasets
Total datasets in this organization
Average FAIR Score
Average FAIR Score per dataset
Total Citations
Total citations to the organization's datasets
Total Mentions
Total mentions of the organization's datasets
S-Index Interpretation
The S-Index (Sharing Index) is a comprehensive metric that represents the cumulative impact of all your datasets. It is calculated as the sum of Dataset Index scores across all your claimed datasets.
What it means:
- A higher S-index indicates greater overall impact of your datasets relative to typical datasets in their fields of research
- The S-Index grows as you add more datasets or as existing datasets gain more citations and mentions
- It provides a single number to track your research data impact over time
Current S-Index: 32.9 (sum of 44 datasets Dataset Index scores)
More information here.
S-Index Over Time
Cumulative Citations Over Time
Cumulative Mentions Over Time
Datasets
Accompanying data for 10.5281/zenodo.8327373, Codebook HT-SELEX, subsampled reads. Update 2025: finalized files for TFs whose datasets were partially mislabeled in the original upload.Affected TFs: CHCHD3, CPXCR1, MYRFL, PRDM5, TIGD4, TIGD5, TTF1, ZBED5, ZBTB5, ZNF578.
Authors
- Ivan Kulakovskiy ;
- Timothy Hughes ;
- Codebook/GRECO-BIT Consortium ;
- Vorontsov, Ilya
Accompanying data for 10.5281/zenodo.8327373, Codebook HT-SELEX, subsampled reads. Update 2025: finalized files for TFs whose datasets were partially mislabeled in the original upload.Affected TFs: CHCHD3, CPXCR1, MYRFL, PRDM5, TIGD4, TIGD5, TTF1, ZBED5, ZBTB5, ZNF578.
Authors
- Ivan Kulakovskiy ;
- Timothy Hughes ;
- Codebook/GRECO-BIT Consortium ;
- Vorontsov, Ilya
This is the supplementary dataset of the GRECO-BIT/Codebook Motif Explorer (MEX),which is a detailed online catalog of DNA motifs built using a huge collection of experimental data produced by the Codebook Consortium to annotate the binding specificities of human transcription factors, see Vorontsov et al. (bioRxiv).Motif discovery for MEX was performed with multiple software tools, and the resulting motifs were rigorously benchmarked against held-out test data. Here you can find detailed benchmarking results, complete motif sets, and the benchmarking-ready preprocessed experimental data. Further details are provided in README.Note: There are two extra accompanying Zenodo datasets, namely,doi:10.5281/zenodo.8327477 (MEX HT-SELEX data) and doi:10.5281/zenodo.8327970 (MEX GHT-SELEX data). Update 2025: replaced files for TFs whose datasets were partially mislabeled in the original upload.Affected TFs: CHCHD3, CPXCR1, MYRFL, PRDM5, TIGD4, TIGD5, TTF1, ZBED5, ZBTB5, ZNF578.Note: PBM and SMS datasets were not affected and the respective files remain the same as in the previous upload.This repository now also provides the set of reference artifact motifs used to filter the MEX results in a variety of commonly accepted formats (TRANSFAC, MEME, JASPAR, plain text).In the last version, we have resolved remaning minor issues with motif metadata and metrics, and included top1-top20 motifs and logos, according to the MEX rankings.
Authors
- Kulakovskiy, Ivan V. ;
- Makeev, Vsevolod J. ;
- Grosse, Ivo ;
- Grau, Jan ;
- Bucher, Philipp ;
- Kolpakov, Fedor ;
- Deplancke, Bart ;
- Hughes, Timothy ;
- Vorontsov, Ilya ;
- Kozin, Ivan ;
- Codebook/GRECO-BIT Consortium
This is the supplementary dataset of the GRECO-BIT/Codebook Motif Explorer (MEX),which is a detailed online catalog of DNA motifs built using a huge collection of experimental data produced by the Codebook Consortium to annotate the binding specificities of human transcription factors, see Vorontsov et al. (bioRxiv).Motif discovery for MEX was performed with multiple software tools, and the resulting motifs were rigorously benchmarked against held-out test data. Here you can find detailed benchmarking results, complete motif sets, and the benchmarking-ready preprocessed experimental data. Further details are provided in README.Note: There are two extra accompanying Zenodo datasets, namely,doi:10.5281/zenodo.8327477 (MEX HT-SELEX data) and doi:10.5281/zenodo.8327970 (MEX GHT-SELEX data). Update 2025: replaced files for TFs whose datasets were partially mislabeled in the original upload.Affected TFs: CHCHD3, CPXCR1, MYRFL, PRDM5, TIGD4, TIGD5, TTF1, ZBED5, ZBTB5, ZNF578.Note: PBM and SMS datasets were not affected and the respective files remain the same as in the previous upload.This repository now also provides the set of reference artifact motifs used to filter the MEX results in a variety of commonly accepted formats (TRANSFAC, MEME, JASPAR, plain text).In the last version, we have resolved remaning minor issues with motif metadata and metrics, and included top1-top20 motifs and logos, according to the MEX rankings.
Authors
- Kulakovskiy, Ivan V. ;
- Makeev, Vsevolod J. ;
- Grosse, Ivo ;
- Grau, Jan ;
- Bucher, Philipp ;
- Kolpakov, Fedor ;
- Deplancke, Bart ;
- Hughes, Timothy ;
- Vorontsov, Ilya ;
- Kozin, Ivan ;
- Codebook/GRECO-BIT Consortium
This is the supplementary dataset of the GRECO-BIT/Codebook Motif Explorer (MEX),which is a detailed online catalog of DNA motifs built using a huge collection of experimental data produced by the Codebook Consortium to annotate the binding specificities of human transcription factors, see Vorontsov et al. (bioRxiv).Motif discovery for MEX was performed with multiple software tools, and the resulting motifs were rigorously benchmarked against held-out test data. Here you can find detailed benchmarking results, complete motif sets, and the benchmarking-ready preprocessed experimental data. Further details are provided in README.*The version update resolves minor file naming issues in the original upload. The files contents remain the same.Note: There are two extra accompanying Zenodo datasets, namely,doi:10.5281/zenodo.8327477 (MEX HT-SELEX data) and doi:10.5281/zenodo.8327970 (MEX GHT-SELEX data). Update 2025: replaced files for TFs whose datasets were partially mislabeled in the original upload.Affected TFs: CHCHD3, CPXCR1, MYRFL, PRDM5, TIGD4, TIGD5, TTF1, ZBED5, ZBTB5, ZNF578.Note: PBM and SMS datasets were not affected and the respective files remain the same as in the previous upload.This repository now also provides the set of reference artifact motifs used to filter the MEX results in a variety of commonly accepted formats (TRANSFAC, MEME, JASPAR, plain text).
Authors
- Kulakovskiy, Ivan V. ;
- Makeev, Vsevolod J. ;
- Grosse, Ivo ;
- Grau, Jan ;
- Bucher, Philipp ;
- Kolpakov, Fedor ;
- Deplancke, Bart ;
- Hughes, Timothy ;
- Vorontsov, Ilya ;
- Kozin, Ivan ;
- Codebook/GRECO-BIT Consortium
Accompanying data for 10.5281/zenodo.8327373, Codebook GHT-SELEX, genomic peaks and subsampled reads. Update 2025: final update for TFs whose datasets were partially mislabeled in the original upload.Affected TFs: CHCHD3, CPXCR1, MYRFL, PRDM5, TIGD4, TIGD5, TTF1, ZBED5, ZBTB5, ZNF578.
Authors
- Ivan Kulakovskiy ;
- Timothy Hughes ;
- Codebook/GRECO-BIT Consortium ;
- Vorontsov, Ilya
Accompanying data for 10.5281/zenodo.8327373, Codebook GHT-SELEX, genomic peaks and subsampled reads. Update 2025: final update for TFs whose datasets were partially mislabeled in the original upload.Affected TFs: CHCHD3, CPXCR1, MYRFL, PRDM5, TIGD4, TIGD5, TTF1, ZBED5, ZBTB5, ZNF578.
Authors
- Ivan Kulakovskiy ;
- Timothy Hughes ;
- Codebook/GRECO-BIT Consortium ;
- Vorontsov, Ilya
Accompanying data for 10.5281/zenodo.8327373, Codebook HT-SELEX, subsampled reads. Update 2025: replaced files for TFs whose datasets were partially mislabeled in the original upload.Affected TFs: CHCHD3, CPXCR1, MYRFL, PRDM5, TIGD4, TIGD5, TTF1, ZBED5, ZBTB5, ZNF578.
Authors
- Ivan Kulakovskiy ;
- Timothy Hughes ;
- Codebook/GRECO-BIT Consortium ;
- Vorontsov, Ilya
This repository contains preprocessed Codebook-MEX data suitable for training and testing advanced models of transcription factor binding sites, see README for details.These data are derived from the ChIP-Seq and genomic HT-SELEX datasets of the primary MEX data release (ZENODO doi:10.5281/zenodo.8327372). Extra:ARCHIPELAGO_models.zip: the ArChIPelago models trained on genomic HT-SELEX and ChIP-Seq data for 139 TFs. Please refer to 5_Archipelago_sav_models_loading_and_scoring.ipynb in the ArChIPelago GitHub repo (https://github.com/autosome-ru/MEX-ArChIPelago) for details on using the model for scanning arbitrary DNA sequences. * Latest update: Archipelago models and underlying data were removed for three TFs (TIGD4, TTF1, ZBED5), which were partially affected by mislabeled samples in the original upload.
Authors
- Penzar, Dmitry ;
- Kravchenko, Pavel ;
- Kozin, Ivan ;
- Kulakovskiy, Ivan
This repository contains preprocessed Codebook-MEX data suitable for training and testing advanced models of transcription factor binding sites, see README for details.These data are derived from the ChIP-Seq and genomic HT-SELEX datasets of the primary MEX data release (ZENODO doi:10.5281/zenodo.8327372). Extra:ARCHIPELAGO_models.zip: the ArChIPelago models trained on genomic HT-SELEX and ChIP-Seq data for 139 TFs. Please refer to 5_Archipelago_sav_models_loading_and_scoring.ipynb in the ArChIPelago GitHub repo (https://github.com/autosome-ru/MEX-ArChIPelago) for details on using the model for scanning arbitrary DNA sequences. * Latest update: Archipelago models and underlying data were removed for three TFs (TIGD4, TTF1, ZBED5), which were partially affected by mislabeled samples in the original upload.
Authors
- Penzar, Dmitry ;
- Kravchenko, Pavel ;
- Kozin, Ivan ;
- Kulakovskiy, Ivan