Automated Organization Profile

Vavilov Institute of General Genetics

Current S-Index

32.9

Sum of Dataset Indices for all datasets

Average Dataset Index per Dataset

0.7

Average Dataset Index per dataset

Total Datasets

44

Total datasets in this organization

Average FAIR Score

75.3%

Average FAIR Score per dataset

Total Citations

6

Total citations to the organization's datasets

Total Mentions

0

Total mentions of the organization's datasets

S-Index Interpretation

S-Index Over Time

Cumulative Citations Over Time

Cumulative Mentions Over Time

Datasets

Codebook MEX Supplementary Dataset 2

Accompanying data for 10.5281/zenodo.8327373, Codebook HT-SELEX, subsampled reads. Update 2025: finalized files for TFs whose datasets were partially mislabeled in the original upload.Affected TFs: CHCHD3, CPXCR1, MYRFL, PRDM5, TIGD4, TIGD5, TTF1, ZBED5, ZBTB5, ZNF578.

Authors

  • Ivan Kulakovskiy ;
  • Timothy Hughes ;
  • Codebook/GRECO-BIT Consortium ;
  • Vorontsov, Ilya
0 Citations0 Mentions79% FAIR0.3 Dataset Index
10.5281/zenodo.156202522025

Codebook MEX Supplementary Dataset 2 (Version: MEX2023.1)

Accompanying data for 10.5281/zenodo.8327373, Codebook HT-SELEX, subsampled reads. Update 2025: finalized files for TFs whose datasets were partially mislabeled in the original upload.Affected TFs: CHCHD3, CPXCR1, MYRFL, PRDM5, TIGD4, TIGD5, TTF1, ZBED5, ZBTB5, ZNF578.

Authors

  • Ivan Kulakovskiy ;
  • Timothy Hughes ;
  • Codebook/GRECO-BIT Consortium ;
  • Vorontsov, Ilya
0 Citations0 Mentions79% FAIR0.3 Dataset Index
10.5281/zenodo.83274762025

Codebook Motif Explorer Supplementary Dataset (Version: MEX2023.2)

This is the supplementary dataset of the GRECO-BIT/Codebook Motif Explorer (MEX),which is a detailed online catalog of DNA motifs built using a huge collection of experimental data produced by the Codebook Consortium to annotate the binding specificities of human transcription factors, see Vorontsov et al. (bioRxiv).Motif discovery for MEX was performed with multiple software tools, and the resulting motifs were rigorously benchmarked against held-out test data. Here you can find detailed benchmarking results, complete motif sets, and the benchmarking-ready preprocessed experimental data. Further details are provided in README.Note: There are two extra accompanying Zenodo datasets, namely,doi:10.5281/zenodo.8327477 (MEX HT-SELEX data) and doi:10.5281/zenodo.8327970 (MEX GHT-SELEX data). Update 2025: replaced files for TFs whose datasets were partially mislabeled in the original upload.Affected TFs: CHCHD3, CPXCR1, MYRFL, PRDM5, TIGD4, TIGD5, TTF1, ZBED5, ZBTB5, ZNF578.Note: PBM and SMS datasets were not affected and the respective files remain the same as in the previous upload.This repository now also provides the set of reference artifact motifs used to filter the MEX results in a variety of commonly accepted formats (TRANSFAC, MEME, JASPAR, plain text).In the last version, we have resolved remaning minor issues with motif metadata and metrics, and included top1-top20 motifs and logos, according to the MEX rankings.

Authors

  • Kulakovskiy, Ivan V. ;
  • Makeev, Vsevolod J. ;
  • Grosse, Ivo ;
  • Grau, Jan ;
  • Bucher, Philipp ;
  • Kolpakov, Fedor ;
  • Deplancke, Bart ;
  • Hughes, Timothy ;
  • Vorontsov, Ilya ;
  • Kozin, Ivan ;
  • Codebook/GRECO-BIT Consortium
2 Citations0 Mentions77% FAIR1.5 Dataset Index
10.5281/zenodo.83273722025

Codebook Motif Explorer Supplementary Dataset

This is the supplementary dataset of the GRECO-BIT/Codebook Motif Explorer (MEX),which is a detailed online catalog of DNA motifs built using a huge collection of experimental data produced by the Codebook Consortium to annotate the binding specificities of human transcription factors, see Vorontsov et al. (bioRxiv).Motif discovery for MEX was performed with multiple software tools, and the resulting motifs were rigorously benchmarked against held-out test data. Here you can find detailed benchmarking results, complete motif sets, and the benchmarking-ready preprocessed experimental data. Further details are provided in README.Note: There are two extra accompanying Zenodo datasets, namely,doi:10.5281/zenodo.8327477 (MEX HT-SELEX data) and doi:10.5281/zenodo.8327970 (MEX GHT-SELEX data). Update 2025: replaced files for TFs whose datasets were partially mislabeled in the original upload.Affected TFs: CHCHD3, CPXCR1, MYRFL, PRDM5, TIGD4, TIGD5, TTF1, ZBED5, ZBTB5, ZNF578.Note: PBM and SMS datasets were not affected and the respective files remain the same as in the previous upload.This repository now also provides the set of reference artifact motifs used to filter the MEX results in a variety of commonly accepted formats (TRANSFAC, MEME, JASPAR, plain text).In the last version, we have resolved remaning minor issues with motif metadata and metrics, and included top1-top20 motifs and logos, according to the MEX rankings.

Authors

  • Kulakovskiy, Ivan V. ;
  • Makeev, Vsevolod J. ;
  • Grosse, Ivo ;
  • Grau, Jan ;
  • Bucher, Philipp ;
  • Kolpakov, Fedor ;
  • Deplancke, Bart ;
  • Hughes, Timothy ;
  • Vorontsov, Ilya ;
  • Kozin, Ivan ;
  • Codebook/GRECO-BIT Consortium
0 Citations0 Mentions79% FAIR0.1 Dataset Index
10.5281/zenodo.156678052025

Codebook Motif Explorer Supplementary Dataset

This is the supplementary dataset of the GRECO-BIT/Codebook Motif Explorer (MEX),which is a detailed online catalog of DNA motifs built using a huge collection of experimental data produced by the Codebook Consortium to annotate the binding specificities of human transcription factors, see Vorontsov et al. (bioRxiv).Motif discovery for MEX was performed with multiple software tools, and the resulting motifs were rigorously benchmarked against held-out test data. Here you can find detailed benchmarking results, complete motif sets, and the benchmarking-ready preprocessed experimental data. Further details are provided in README.*The version update resolves minor file naming issues in the original upload. The files contents remain the same.Note: There are two extra accompanying Zenodo datasets, namely,doi:10.5281/zenodo.8327477 (MEX HT-SELEX data) and doi:10.5281/zenodo.8327970 (MEX GHT-SELEX data). Update 2025: replaced files for TFs whose datasets were partially mislabeled in the original upload.Affected TFs: CHCHD3, CPXCR1, MYRFL, PRDM5, TIGD4, TIGD5, TTF1, ZBED5, ZBTB5, ZNF578.Note: PBM and SMS datasets were not affected and the respective files remain the same as in the previous upload.This repository now also provides the set of reference artifact motifs used to filter the MEX results in a variety of commonly accepted formats (TRANSFAC, MEME, JASPAR, plain text).

Authors

  • Kulakovskiy, Ivan V. ;
  • Makeev, Vsevolod J. ;
  • Grosse, Ivo ;
  • Grau, Jan ;
  • Bucher, Philipp ;
  • Kolpakov, Fedor ;
  • Deplancke, Bart ;
  • Hughes, Timothy ;
  • Vorontsov, Ilya ;
  • Kozin, Ivan ;
  • Codebook/GRECO-BIT Consortium
0 Citations0 Mentions79% FAIR0.1 Dataset Index
10.5281/zenodo.156034892025

Codebook MEX Supplementary Dataset 3

Accompanying data for 10.5281/zenodo.8327373, Codebook GHT-SELEX, genomic peaks and subsampled reads. Update 2025: final update for TFs whose datasets were partially mislabeled in the original upload.Affected TFs: CHCHD3, CPXCR1, MYRFL, PRDM5, TIGD4, TIGD5, TTF1, ZBED5, ZBTB5, ZNF578.

Authors

  • Ivan Kulakovskiy ;
  • Timothy Hughes ;
  • Codebook/GRECO-BIT Consortium ;
  • Vorontsov, Ilya
0 Citations0 Mentions79% FAIR0.3 Dataset Index
10.5281/zenodo.156199912025

Codebook MEX Supplementary Dataset 3 (Version: MEX2023.1)

Accompanying data for 10.5281/zenodo.8327373, Codebook GHT-SELEX, genomic peaks and subsampled reads. Update 2025: final update for TFs whose datasets were partially mislabeled in the original upload.Affected TFs: CHCHD3, CPXCR1, MYRFL, PRDM5, TIGD4, TIGD5, TTF1, ZBED5, ZBTB5, ZNF578.

Authors

  • Ivan Kulakovskiy ;
  • Timothy Hughes ;
  • Codebook/GRECO-BIT Consortium ;
  • Vorontsov, Ilya
0 Citations0 Mentions79% FAIR0.3 Dataset Index
10.5281/zenodo.83279692025

Codebook MEX Supplementary Dataset 2

Accompanying data for 10.5281/zenodo.8327373, Codebook HT-SELEX, subsampled reads. Update 2025: replaced files for TFs whose datasets were partially mislabeled in the original upload.Affected TFs: CHCHD3, CPXCR1, MYRFL, PRDM5, TIGD4, TIGD5, TTF1, ZBED5, ZBTB5, ZNF578.

Authors

  • Ivan Kulakovskiy ;
  • Timothy Hughes ;
  • Codebook/GRECO-BIT Consortium ;
  • Vorontsov, Ilya
0 Citations0 Mentions79% FAIR0.3 Dataset Index
10.5281/zenodo.156042022025

MEX-Archipelago data (Version: 2024.1)

This repository contains preprocessed Codebook-MEX data suitable for training and testing advanced models of transcription factor binding sites, see README for details.These data are derived from the ChIP-Seq and genomic HT-SELEX datasets of the primary MEX data release (ZENODO doi:10.5281/zenodo.8327372).  Extra:ARCHIPELAGO_models.zip: the ArChIPelago models trained on genomic HT-SELEX and ChIP-Seq data for 139 TFs. Please refer to 5_Archipelago_sav_models_loading_and_scoring.ipynb in the ArChIPelago GitHub repo (https://github.com/autosome-ru/MEX-ArChIPelago) for details on using the model for scanning arbitrary DNA sequences. * Latest update: Archipelago models and underlying data were removed for three TFs (TIGD4, TTF1, ZBED5), which were partially affected by mislabeled samples in the original upload.

Authors

  • Penzar, Dmitry ;
  • Kravchenko, Pavel ;
  • Kozin, Ivan ;
  • Kulakovskiy, Ivan
2 Citations0 Mentions73% FAIR1.0 Dataset Index
10.5281/zenodo.105153072025

MEX-Archipelago data

This repository contains preprocessed Codebook-MEX data suitable for training and testing advanced models of transcription factor binding sites, see README for details.These data are derived from the ChIP-Seq and genomic HT-SELEX datasets of the primary MEX data release (ZENODO doi:10.5281/zenodo.8327372).  Extra:ARCHIPELAGO_models.zip: the ArChIPelago models trained on genomic HT-SELEX and ChIP-Seq data for 139 TFs. Please refer to 5_Archipelago_sav_models_loading_and_scoring.ipynb in the ArChIPelago GitHub repo (https://github.com/autosome-ru/MEX-ArChIPelago) for details on using the model for scanning arbitrary DNA sequences. * Latest update: Archipelago models and underlying data were removed for three TFs (TIGD4, TTF1, ZBED5), which were partially affected by mislabeled samples in the original upload.

Authors

  • Penzar, Dmitry ;
  • Kravchenko, Pavel ;
  • Kozin, Ivan ;
  • Kulakovskiy, Ivan
0 Citations0 Mentions79% FAIR0.1 Dataset Index
10.5281/zenodo.155552512025