Automated Author Profile

Li, Yue

McGill University

Current S-Index

3.5

Sum of Dataset Indices for all datasets

Average Dataset Index per Dataset

1.2

Average Dataset Index per dataset

Total Datasets

3

Total datasets for this author

Average FAIR Score

54.5%

Average FAIR Score per dataset

Total Citations

0

Total citations to the author's datasets

Total Mentions

0

Total mentions of the author's datasets

S-Index Interpretation

S-Index Over Time

Cumulative Citations Over Time

Cumulative Mentions Over Time

Datasets

LD matrices from the White British cohort in the UK Biobank in Zarr format (Version: 2.0)

This dataset contains the Linkage Disequilibrium (LD) matrices that were used in the analyses described in the manuscript: Fast and Accurate Bayesian Polygenic Risk Modeling with Variational Inference
Shadi Zabad, Simon Gravel, Yue Li
McGill University LD matrices record the SNP-by-SNP correlations in a given sample of individuals from a general population. In this case, we threshold the matrices so that we only record the correlations between SNPs that are at most 3 centi Morgan apart. These matrices record the SNP correlations in a random sample of 50,000 individuals from the White British cohort in the UK Biobank dataset. There is one matrix per autosomal chromosome (chr_1, chr_2, ..., chr_22). The matrices are stored in Zarr format, a chunked on-disk array storage format that allows for multi-threaded read and write access. To access these matrices, consult the codebase of magenpy, our custom python package with special data structures for processing these LD matrices. UPDATE (03/09/2022): We updated the matrices to add the reference allele attribute (A2) and we also now have one tar archive per chromosome.

Authors

  • Zabad, Shadi ;
  • Gravel, Simon ;
  • Li, Yue
0 Citations0 Mentions13% FAIR0.3 Dataset Index
10.5281/zenodo.6529228September 2022

LD matrices from the White British cohort in the UK Biobank in Zarr format (Version: 2.0)

This dataset contains the Linkage Disequilibrium (LD) matrices that were used in the analyses described in the manuscript: Fast and Accurate Bayesian Polygenic Risk Modeling with Variational Inference
Shadi Zabad, Simon Gravel, Yue Li
McGill University LD matrices record the SNP-by-SNP correlations in a given sample of individuals from a general population. In this case, we threshold the matrices so that we only record the correlations between SNPs that are at most 3 centi Morgan apart. These matrices record the SNP correlations in a random sample of 50,000 individuals from the White British cohort in the UK Biobank dataset. There is one matrix per autosomal chromosome (chr_1, chr_2, ..., chr_22). The matrices are stored in Zarr format, a chunked on-disk array storage format that allows for multi-threaded read and write access. To access these matrices, consult the codebase of magenpy, our custom python package with special data structures for processing these LD matrices. UPDATE (03/09/2022): We updated the matrices to add the reference allele attribute (A2) and we also now have one tar archive per chromosome.

Authors

  • Zabad, Shadi ;
  • Gravel, Simon ;
  • Li, Yue
0 Citations0 Mentions73% FAIR1.6 Dataset Index
10.5281/zenodo.7036625September 2022

LD matrices from the White British cohort in the UK Biobank in Zarr format (Version: 1.0)

This dataset contains the Linkage Disequilibrium (LD) matrices that were used in the analyses described in the manuscript: Fast and Accurate Bayesian Polygenic Risk Modeling with Variational Inference
Shadi Zabad, Simon Gravel, Yue Li
McGill University LD matrices record the SNP-by-SNP correlations in a given sample of individuals from a general population. In this case, we threshold the matrices so that we only record the correlations between SNPs that are at most 3 centi Morgan apart. These matrices record the SNP correlations in a random sample of 50,000 individuals from the White British cohort in the UK Biobank dataset. There is one matrix per autosomal chromosome (chr_1, chr_2, ..., chr_22). The matrices are stored in Zarr format, a chunked on-disk array storage format that allows for multi-threaded read and write access. To access these matrices, consult the codebase of magenpy, our custom python package with special data structures for processing these LD matrices.

Authors

  • Zabad, Shadi ;
  • Gravel, Simon ;
  • Li, Yue
0 Citations0 Mentions77% FAIR1.7 Dataset Index
10.5281/zenodo.6529229May 2022