Automated Author ProfileMarkowitz, Sanford
Markowitz, Sanford
Current S-Index
Sum of Dataset Indices for all datasets
Average Dataset Index per Dataset
Average Dataset Index per dataset
Total Datasets
Total datasets for this author
Average FAIR Score
Average FAIR Score per dataset
Total Citations
Total citations to the author's datasets
Total Mentions
Total mentions of the author's datasets
S-Index Interpretation
The S-Index (Sharing Index) is a comprehensive metric that represents the cumulative impact of all your datasets. It is calculated as the sum of Dataset Index scores across all your claimed datasets.
What it means:
- A higher S-index indicates greater overall impact of your datasets relative to typical datasets in their fields of research
- The S-Index grows as you add more datasets or as existing datasets gain more citations and mentions
- It provides a single number to track your research data impact over time
Current S-Index: 15.7 (sum of 24 datasets Dataset Index scores)
More information here.
S-Index Over Time
Cumulative Citations Over Time
Cumulative Mentions Over Time
Datasets
Colorectal cancer (CRC) is a leading cause of mortality worldwide. We conducted a genome-wide association study meta-analysis of 100,204 CRC cases and 154,587 controls of European and Asian ancestry, identifying 205 independent risk associations, of which 50 were unreported. We performed integrative genomic, transcriptomic and methylomic analyses across large bowel mucosa and other tissues. Transcriptome- and methylome-wide association studies revealed an additional 53 risk associations. We identified 155 high confidence effector genes functionally linked to CRC risk, many of which had no previously established role in CRC. These have multiple different functions, and specifically indicate that variation in normal colorectal homeostasis, proliferation, cell adhesion, migration, immunity and microbial interactions determines CRC risk. Cross-tissue analyses indicated that over a third of effector genes most likely act outside the colonic mucosa. Our findings provide insights into colorectal oncogenesis, and highlight potential targets across tissues for new CRC treatment and chemoprevention strategies. The data submitted here are expression and methylation models with LD reference data for the transcriptome-wide (TWAS), methylome-wide (MWAS) and transcript isoform-wide association study (TIsWAS) as described in the manuscript "Deciphering colorectal cancer genetics through multi-omic analysis of 100,204 cases and 154,587 controls of European and East Asian ancestries". Details of the methods are presented in the method section and supplementary information file. TWAS analysis Gene expression models for the six in-house expression datasets were generated using the PredictDB v7 pipeline for a total of 1,077 participants. Elastic net model building with 10-fold cross-validation was performed independently for each dataset. The elastic net models for GTEx v8 Colon Transverse were obtained from the PredictDB data repository (http://predictdb.org/) and had been generated using the same pipeline. Models were computed using HapMap2 SNPs ±1Mb from each gene, together with covariate factors estimated using PEER32, clinical covariates when appropriate (age, sex and, where appropriate, case-control status, type of polyp and anatomic location in the colorectum), and three PCs from the individual dataset’s SNP genotype data. Transcript-based TWAS analyses (TIsWAS) were likewise performed by using transcript-level data from the SOCCS, BarcUVa-Seq and GTEx Colon Transverse datasets. MWAS analysis Methylation beta values were calculated based on the manufacturer’s standard, ranging from 0 to 1. Quality control and data normalization were performed in R using the ChAMP software pipeline for the EPIC and 450K arrays. Briefly, we filtered out failed probes with detection P > 0.02 in >5% of samples, probes with <3 reads in >5% of samples per probe and all non-CpG probes. Samples with failed probes >0.1 were also excluded from downstream analyses. We discarded all probes with SNPs within 10bp of the interrogated CpG (from 1,000 Genomes Project, CEU population)34, and probes that ambiguously mapped to multiple locations in the human genome with up to two mismatches33. We only considered probes mapping to autosomes and those overlapping between the EPIC and the 450K arrays. Normalization was achieved using the Beta MIxture Quantile (BMIQ) method. Per probe methylation models were created using the PredictDB pipeline on the normalized methylation matrix and the genotypes as per TWAS eQTL analysis. To optimize power, we restricted our analysis to 263,341-238,443 (for the 450K array) and 377,678 (for the EPIC array) probes annotated to Islands, Shores and Shelves, and discarded “Open Sea” regions.
Authors
- Fernandez-Rozadilla, Ceres ;
- Timofeeva, Maria ;
- Zhishan Chen ;
- Law, Philip ;
- Minta Thomas ;
- Schmit, Stephanie ;
- Díez-Obrero, Virginia ;
- Hsu, Li ;
- Fernandez-Tajes, Juan ;
- Palles, Claire ;
- Sherwood, Kitty ;
- Briggs, Sarah ;
- Svinti, Victoria ;
- Donnelly, Kevin ;
- Farrington, Susan ;
- Blackmur, James ;
- Vaughan-Shaw, Peter ;
- Xiao-Ou Shu ;
- Jirong Long ;
- Qiuyin Cai ;
- Xingyi Guo ;
- Yingchang Lu ;
- Broderick, Peter ;
- Studd, James ;
- Huyghe, Jeroen ;
- Harrison, Tabitha ;
- Conti, David ;
- Dampier, Christopher ;
- Devall, Mathew ;
- Schumacher, Fredrick ;
- Melas, Marilena ;
- Rennert, Gad ;
- Obón-Santacana, Mireia ;
- Martín-Sánchez, Vicente ;
- Moratalla-Navarro, Ferran ;
- Oh, Jae Hwan ;
- Jeongseon Kim ;
- Jee, Sun Ha ;
- Jung, Keum Ji ;
- Sun-Seog Kweon ;
- Shin, Min-Ho ;
- Aesun Shin ;
- Ahn, Yoon-Ok ;
- Kim, Dong-Hyun ;
- Oze, Isao ;
- Wanqing Wen ;
- Keitaro Matsuo ;
- Matsuda, Koichi ;
- Tanikawa, Chizu ;
- Zefang Ren ;
- Yu-Tang Gao ;
- Jia, Wei-Hua ;
- Hopper, John ;
- Jenkins, Mark ;
- Aung Ko Win ;
- Rish Pai ;
- Figueiredo, Jane ;
- Haile, Robert ;
- Gallinger, Steven ;
- Woods, Michael ;
- Newcomb, Polly ;
- Duggan, David ;
- Cheadle, Jeremy ;
- Kaplan, Richard ;
- Maughan, Timothy ;
- Kerr, Rachel ;
- Kerr, David ;
- Kirac, Iva ;
- Böhm, Jan ;
- Lukka-Pekka Mecklin ;
- Jousilahti, Pekka ;
- Knekt, Paul ;
- Aaltonen, Lauri ;
- Rissanen, Harri ;
- Pukkala, Eero ;
- Eriksson, Johan ;
- Cajuso, Tatiana ;
- Hänninen, Ulrika ;
- Kondelin, Johanna ;
- Palin, Kimmo ;
- Tanskanen, Tomas ;
- Renkonen-Sinisalo, Laura ;
- Zanke, Brent ;
- Männistö, Satu ;
- Albanes, Demetrius ;
- Weinstein, Stephanie ;
- Ruiz-Narvaez, Edward ;
- Palmer, Julie ;
- Buchanan, Daniel ;
- Platz, Elizabeth ;
- Visvanathan, Kala ;
- Ulrich, Cornelia ;
- Siegel, Erin ;
- Brezina, Stefanie ;
- Gsur, Andrea ;
- Campbell, Peter ;
- Chang-Claude, Jenny ;
- Hoffmeister, Michael ;
- Brenner, Hermann ;
- Slattery, Martha ;
- Potter, John ;
- Tsilidis, Konstantinos ;
- Schulze, Matthias ;
- Gunter, Marc ;
- Murphy, Neil ;
- Castells, Antoni ;
- Castellví-Bel, Sergi ;
- Moreira, Leticia ;
- Arndt, Volker ;
- Shcherbina, Anna ;
- Stern, Mariana ;
- Bens Pardamean ;
- Bishop, Timothy ;
- Giles, Graham ;
- Southey, Melissa ;
- Idos, Gregory ;
- McDonnell, Kevin ;
- Zomoroda Abu-Ful ;
- Greenson, Joel ;
- Shulman, Katerina ;
- Lejbkowicz, Flavio ;
- Offit, Kenneth ;
- Su, Yu-Ru ;
- Steinfelder, Robert ;
- Temitope Keku ;
- Van Guelpen, Bethany ;
- Hudson, Thomas ;
- Hampel, Heather ;
- Pearlman, Rachel ;
- Berndt, Sonja ;
- Hayes, Richard ;
- Martinez, Marie Elena ;
- Thomas, Sushma ;
- Corley, Douglas ;
- Pharoah, Paul ;
- Larsson, Susanna ;
- Yen, Yun ;
- Heinz-Josef Lenz ;
- White, Emily ;
- Li, Li ;
- Doheny, Kimberly ;
- Pugh, Elizabeth ;
- Shelford, Tameka ;
- Chan, Andrew ;
- Cruz-Correa, Marcia ;
- Lindblom, Annika ;
- Hunter, David ;
- Joshi, Amit ;
- Schafmayer, Clemens ;
- Scacheri, Peter ;
- Anshul Kundaje ;
- Nickerson, Deborah ;
- Schoen, Robert ;
- Hampe, Jochen ;
- Zsofia Stadler ;
- Vodicka, Pavel ;
- Vodickova, Ludmila ;
- Vymetalkova, Veronika ;
- Papadopoulos, Nickolas ;
- Chistopher Edlund ;
- Gauderman, William ;
- Thomas, Duncan ;
- Shibata, David ;
- Toland, Amanda ;
- Markowitz, Sanford ;
- Kim, Andre ;
- Chanock, Stephen ;
- Franzel Van Duijnhoven ;
- Feskens, Edith ;
- Sakoda, Lori ;
- Gago-Dominguez, Manuela ;
- Wolk, Alicja ;
- Naccarati, Alessio ;
- Pardini, Barbara ;
- FitzGerald, Liesel ;
- Lee, Soo Chin ;
- Ogino, Shuji ;
- Bien, Stephanie ;
- Kooperberg, Charles ;
- Li, Christopher ;
- Lin, Yi ;
- Prentice, Ross ;
- Conghui Qu ;
- Bézieau, Stéphane ;
- Tangen, Catherine ;
- Mardis, Elaine ;
- Yamaji, Taiki ;
- Sawada, Norie ;
- Iwasaki, Motoki ;
- Haiman, Christopher ;
- Loic Le Marchand ;
- Wu, Anna ;
- Chenxu Qu ;
- McNeil, Caroline ;
- Coetzee, Gerhard ;
- Hayward, Caroline ;
- Deary, Ian ;
- Harris, Sarah ;
- Evropi Theodoratou ;
- Reid, Stuart ;
- Walker, Marion ;
- Ooi, Li Yin ;
- Moreno, Victor ;
- Casey, Graham ;
- Gruber, Stephen ;
- Tomlinson, Ian ;
- Zheng, Wei ;
- Dunlop, Malcolm ;
- Houlston, Richard ;
- Peters, Ulrike
Colorectal cancer (CRC) is a leading cause of mortality worldwide. We conducted a genome-wide association study meta-analysis of 100,204 CRC cases and 154,587 controls of European and Asian ancestry, identifying 205 independent risk associations, of which 50 were unreported. We performed integrative genomic, transcriptomic and methylomic analyses across large bowel mucosa and other tissues. Transcriptome- and methylome-wide association studies revealed an additional 53 risk associations. We identified 155 high confidence effector genes functionally linked to CRC risk, many of which had no previously established role in CRC. These have multiple different functions, and specifically indicate that variation in normal colorectal homeostasis, proliferation, cell adhesion, migration, immunity and microbial interactions determines CRC risk. Cross-tissue analyses indicated that over a third of effector genes most likely act outside the colonic mucosa. Our findings provide insights into colorectal oncogenesis, and highlight potential targets across tissues for new CRC treatment and chemoprevention strategies. The data submitted here are expression and methylation models with LD reference data for the transcriptome-wide (TWAS), methylome-wide (MWAS) and transcript isoform-wide association study (TIsWAS) as described in the manuscript "Deciphering colorectal cancer genetics through multi-omic analysis of 100,204 cases and 154,587 controls of European and East Asian ancestries". Details of the methods are presented in the method section and supplementary information file. TWAS analysis Gene expression models for the six in-house expression datasets were generated using the PredictDB v7 pipeline for a total of 1,077 participants. Elastic net model building with 10-fold cross-validation was performed independently for each dataset. The elastic net models for GTEx v8 Colon Transverse were obtained from the PredictDB data repository (http://predictdb.org/) and had been generated using the same pipeline. Models were computed using HapMap2 SNPs ±1Mb from each gene, together with covariate factors estimated using PEER32, clinical covariates when appropriate (age, sex and, where appropriate, case-control status, type of polyp and anatomic location in the colorectum), and three PCs from the individual dataset’s SNP genotype data. Transcript-based TWAS analyses (TIsWAS) were likewise performed by using transcript-level data from the SOCCS, BarcUVa-Seq and GTEx Colon Transverse datasets. MWAS analysis Methylation beta values were calculated based on the manufacturer’s standard, ranging from 0 to 1. Quality control and data normalization were performed in R using the ChAMP software pipeline for the EPIC and 450K arrays. Briefly, we filtered out failed probes with detection P > 0.02 in >5% of samples, probes with <3 reads in >5% of samples per probe and all non-CpG probes. Samples with failed probes >0.1 were also excluded from downstream analyses. We discarded all probes with SNPs within 10bp of the interrogated CpG (from 1,000 Genomes Project, CEU population)34, and probes that ambiguously mapped to multiple locations in the human genome with up to two mismatches33. We only considered probes mapping to autosomes and those overlapping between the EPIC and the 450K arrays. Normalization was achieved using the Beta MIxture Quantile (BMIQ) method. Per probe methylation models were created using the PredictDB pipeline on the normalized methylation matrix and the genotypes as per TWAS eQTL analysis. To optimize power, we restricted our analysis to 263,341-238,443 (for the 450K array) and 377,678 (for the EPIC array) probes annotated to Islands, Shores and Shelves, and discarded “Open Sea” regions.
Authors
- Fernandez-Rozadilla, Ceres ;
- Timofeeva, Maria ;
- Zhishan Chen ;
- Law, Philip ;
- Minta Thomas ;
- Schmit, Stephanie ;
- Díez-Obrero, Virginia ;
- Hsu, Li ;
- Fernandez-Tajes, Juan ;
- Palles, Claire ;
- Sherwood, Kitty ;
- Briggs, Sarah ;
- Svinti, Victoria ;
- Donnelly, Kevin ;
- Farrington, Susan ;
- Blackmur, James ;
- Vaughan-Shaw, Peter ;
- Xiao-Ou Shu ;
- Jirong Long ;
- Qiuyin Cai ;
- Xingyi Guo ;
- Yingchang Lu ;
- Broderick, Peter ;
- Studd, James ;
- Huyghe, Jeroen ;
- Harrison, Tabitha ;
- Conti, David ;
- Dampier, Christopher ;
- Devall, Mathew ;
- Schumacher, Fredrick ;
- Melas, Marilena ;
- Rennert, Gad ;
- Obón-Santacana, Mireia ;
- Martín-Sánchez, Vicente ;
- Moratalla-Navarro, Ferran ;
- Oh, Jae Hwan ;
- Jeongseon Kim ;
- Jee, Sun Ha ;
- Jung, Keum Ji ;
- Sun-Seog Kweon ;
- Shin, Min-Ho ;
- Aesun Shin ;
- Ahn, Yoon-Ok ;
- Kim, Dong-Hyun ;
- Oze, Isao ;
- Wanqing Wen ;
- Keitaro Matsuo ;
- Matsuda, Koichi ;
- Tanikawa, Chizu ;
- Zefang Ren ;
- Yu-Tang Gao ;
- Jia, Wei-Hua ;
- Hopper, John ;
- Jenkins, Mark ;
- Aung Ko Win ;
- Rish Pai ;
- Figueiredo, Jane ;
- Haile, Robert ;
- Gallinger, Steven ;
- Woods, Michael ;
- Newcomb, Polly ;
- Duggan, David ;
- Cheadle, Jeremy ;
- Kaplan, Richard ;
- Maughan, Timothy ;
- Kerr, Rachel ;
- Kerr, David ;
- Kirac, Iva ;
- Böhm, Jan ;
- Lukka-Pekka Mecklin ;
- Jousilahti, Pekka ;
- Knekt, Paul ;
- Aaltonen, Lauri ;
- Rissanen, Harri ;
- Pukkala, Eero ;
- Eriksson, Johan ;
- Cajuso, Tatiana ;
- Hänninen, Ulrika ;
- Kondelin, Johanna ;
- Palin, Kimmo ;
- Tanskanen, Tomas ;
- Renkonen-Sinisalo, Laura ;
- Zanke, Brent ;
- Männistö, Satu ;
- Albanes, Demetrius ;
- Weinstein, Stephanie ;
- Ruiz-Narvaez, Edward ;
- Palmer, Julie ;
- Buchanan, Daniel ;
- Platz, Elizabeth ;
- Visvanathan, Kala ;
- Ulrich, Cornelia ;
- Siegel, Erin ;
- Brezina, Stefanie ;
- Gsur, Andrea ;
- Campbell, Peter ;
- Chang-Claude, Jenny ;
- Hoffmeister, Michael ;
- Brenner, Hermann ;
- Slattery, Martha ;
- Potter, John ;
- Tsilidis, Konstantinos ;
- Schulze, Matthias ;
- Gunter, Marc ;
- Murphy, Neil ;
- Castells, Antoni ;
- Castellví-Bel, Sergi ;
- Moreira, Leticia ;
- Arndt, Volker ;
- Shcherbina, Anna ;
- Stern, Mariana ;
- Bens Pardamean ;
- Bishop, Timothy ;
- Giles, Graham ;
- Southey, Melissa ;
- Idos, Gregory ;
- McDonnell, Kevin ;
- Zomoroda Abu-Ful ;
- Greenson, Joel ;
- Shulman, Katerina ;
- Lejbkowicz, Flavio ;
- Offit, Kenneth ;
- Su, Yu-Ru ;
- Steinfelder, Robert ;
- Temitope Keku ;
- Van Guelpen, Bethany ;
- Hudson, Thomas ;
- Hampel, Heather ;
- Pearlman, Rachel ;
- Berndt, Sonja ;
- Hayes, Richard ;
- Martinez, Marie Elena ;
- Thomas, Sushma ;
- Corley, Douglas ;
- Pharoah, Paul ;
- Larsson, Susanna ;
- Yen, Yun ;
- Heinz-Josef Lenz ;
- White, Emily ;
- Li, Li ;
- Doheny, Kimberly ;
- Pugh, Elizabeth ;
- Shelford, Tameka ;
- Chan, Andrew ;
- Cruz-Correa, Marcia ;
- Lindblom, Annika ;
- Hunter, David ;
- Joshi, Amit ;
- Schafmayer, Clemens ;
- Scacheri, Peter ;
- Anshul Kundaje ;
- Nickerson, Deborah ;
- Schoen, Robert ;
- Hampe, Jochen ;
- Zsofia Stadler ;
- Vodicka, Pavel ;
- Vodickova, Ludmila ;
- Vymetalkova, Veronika ;
- Papadopoulos, Nickolas ;
- Chistopher Edlund ;
- Gauderman, William ;
- Thomas, Duncan ;
- Shibata, David ;
- Toland, Amanda ;
- Markowitz, Sanford ;
- Kim, Andre ;
- Chanock, Stephen ;
- Franzel Van Duijnhoven ;
- Feskens, Edith ;
- Sakoda, Lori ;
- Gago-Dominguez, Manuela ;
- Wolk, Alicja ;
- Naccarati, Alessio ;
- Pardini, Barbara ;
- FitzGerald, Liesel ;
- Lee, Soo Chin ;
- Ogino, Shuji ;
- Bien, Stephanie ;
- Kooperberg, Charles ;
- Li, Christopher ;
- Lin, Yi ;
- Prentice, Ross ;
- Conghui Qu ;
- Bézieau, Stéphane ;
- Tangen, Catherine ;
- Mardis, Elaine ;
- Yamaji, Taiki ;
- Sawada, Norie ;
- Iwasaki, Motoki ;
- Haiman, Christopher ;
- Loic Le Marchand ;
- Wu, Anna ;
- Chenxu Qu ;
- McNeil, Caroline ;
- Coetzee, Gerhard ;
- Hayward, Caroline ;
- Deary, Ian ;
- Harris, Sarah ;
- Evropi Theodoratou ;
- Reid, Stuart ;
- Walker, Marion ;
- Ooi, Li Yin ;
- Moreno, Victor ;
- Casey, Graham ;
- Gruber, Stephen ;
- Tomlinson, Ian ;
- Zheng, Wei ;
- Dunlop, Malcolm ;
- Houlston, Richard ;
- Peters, Ulrike
Dataset 4: DACOR1-mediated changes in Gene Body methylation. Dataset contains locations of all differentially methylated CpG sites for top ten genes of interest, as well as identification if a specific site is located in an intron or an exon. (XLSX 34 kb)
Authors
- Saigopal Somasundaram ;
- Forrest, Megan ;
- Moinova, Helen ;
- Cohen, Allison ;
- Varadan, Vinay ;
- LaFramboise, Thomas ;
- Markowitz, Sanford ;
- Khalil, Ahmad
Dataset 10: Localization of modified CpGs. Modified CpGs according to their localization in CpG islands, CpG islands shores or shelves. (XLSX 774 kb)
Authors
- Saigopal Somasundaram ;
- Forrest, Megan ;
- Moinova, Helen ;
- Cohen, Allison ;
- Varadan, Vinay ;
- LaFramboise, Thomas ;
- Markowitz, Sanford ;
- Khalil, Ahmad
Dataset 10: Localization of modified CpGs. Modified CpGs according to their localization in CpG islands, CpG islands shores or shelves. (XLSX 774 kb)
Authors
- Saigopal Somasundaram ;
- Forrest, Megan ;
- Moinova, Helen ;
- Cohen, Allison ;
- Varadan, Vinay ;
- LaFramboise, Thomas ;
- Markowitz, Sanford ;
- Khalil, Ahmad
Dataset 9: RRBS Cohort demographic and Phenotypic data. Demographic and phenotypic information of the cohort analyzed in Fig. 1. (XLSX 14 kb)
Authors
- Saigopal Somasundaram ;
- Forrest, Megan ;
- Moinova, Helen ;
- Cohen, Allison ;
- Varadan, Vinay ;
- LaFramboise, Thomas ;
- Markowitz, Sanford ;
- Khalil, Ahmad
Dataset 9: RRBS Cohort demographic and Phenotypic data. Demographic and phenotypic information of the cohort analyzed in Fig. 1. (XLSX 14 kb)
Authors
- Saigopal Somasundaram ;
- Forrest, Megan ;
- Moinova, Helen ;
- Cohen, Allison ;
- Varadan, Vinay ;
- LaFramboise, Thomas ;
- Markowitz, Sanford ;
- Khalil, Ahmad
Dataset 1: RRBS methylation data from colon tumors vs. normal colon samples. Dataset contains methylated CpG sites for 83 colon tumors and 40 normal colon samples. Additionally, gene names with corresponding counts of differential methylation are provided for gene bodies and promoter regions. (XLSX 797 kb)
Authors
- Saigopal Somasundaram ;
- Forrest, Megan ;
- Moinova, Helen ;
- Cohen, Allison ;
- Varadan, Vinay ;
- LaFramboise, Thomas ;
- Markowitz, Sanford ;
- Khalil, Ahmad
Dataset 1: RRBS methylation data from colon tumors vs. normal colon samples. Dataset contains methylated CpG sites for 83 colon tumors and 40 normal colon samples. Additionally, gene names with corresponding counts of differential methylation are provided for gene bodies and promoter regions. (XLSX 797 kb)
Authors
- Saigopal Somasundaram ;
- Forrest, Megan ;
- Moinova, Helen ;
- Cohen, Allison ;
- Varadan, Vinay ;
- LaFramboise, Thomas ;
- Markowitz, Sanford ;
- Khalil, Ahmad
Dataset 7: RRBS assay of 83 Colon Tumors and 40 Normal Colon Samples. Complete information on CpG sites analyzed from RRBS assay of Colon Tumors and Normal Colon samples, filtered to sites with at least a 30% change in methylation with all referenced samples having at least 10 reads per site. (XLSX 121505 kb)
Authors
- Saigopal Somasundaram ;
- Forrest, Megan ;
- Moinova, Helen ;
- Cohen, Allison ;
- Varadan, Vinay ;
- LaFramboise, Thomas ;
- Markowitz, Sanford ;
- Khalil, Ahmad