Automated Author Profile

Lamberton, Poppy

0000-0003-1048-6318

Current S-Index

11.1

Sum of Dataset Indices for all datasets

Average Dataset Index per Dataset

1.1

Average Dataset Index per dataset

Total Datasets

10

Total datasets for this author

Average FAIR Score

66.7%

Average FAIR Score per dataset

Total Citations

3

Total citations to the author's datasets

Total Mentions

2

Total mentions of the author's datasets

S-Index Interpretation

S-Index Over Time

Cumulative Citations Over Time

Cumulative Mentions Over Time

Datasets

Variation in water contact behaviour and risk of Schistosoma mansoni (re)infection among Ugandan school-aged children in an area with persistent high endemicity.

Snail species, abundance and physicochemical water factors by contact site and season.

Authors

  • Trienekens, Suzan ;
  • Faust, Christina ;
  • Besigye, Fred ;
  • Pickering, Lucy ;
  • Tukahebwa, Edridah ;
  • Seeley, Janet ;
  • Lamberton, Poppy
0 Citations0 Mentions73% FAIR1.6 Dataset Index
10.5525/gla.researchdata.1233January 2022

Data release: Whole-genome sequencing of Schistosoma mansoni reveals extensive diversity with limited selection despite repeated mass drug administration

Source data used in the publication: Berger et al. (2021) - Provisional title: Whole-genome sequencing of Schistosoma mansoni reveals extensive diversity with limited selection despite repeated mass drug administration. These data were used to generate all figures used in the publication and all files are organised and labelled specifically to run with the custom code that uses these data can be found at: DOI:10.5281/zenodo.4975909. (I'm happy to answer any questions just email [email protected], manuscript that goes with these data is about to be resubmitted) File descriptions:
prunedData.eigenvec - Figure 2a&b - PCA eigenvectors prunedData.eigenval - Figure 2a&b- PCA eigenvalues autosomes.mdist - Figure 2c - PLINK distance matrix used to build the neighbour joining phylogeny all.pi.pixy.schools.txt - Figure 2d - Nucleotide diversity results for each school subpopulation autosomes.dxy.5kb.schools.txt - Figure 2e - Autosomal DXY results between school subpopulations. autosomes.fst.5kb.schools.txt - Figure 2e - Autosomal FST results between school subpopulations. admixture_all.txt - Figure 2f - ADMIXTURE results for each sample and population sizes, column 1 represents number of populations (K), columns 3-8 represent admixture values for each population. sfs.csv - Figure 3a, Supplementary figure 10a - Site frequency spectra (allelic proportions at each frequency bin) for each school. TD.all.txt - Figure 3b - Tajima's D values calculated in 5 kb windows for each school subpopulation. ALL.MAYUGE.IHS.ihs.out.100bins.norm.txt.zip - Figure 4a - Normalised iHS scores for the Mayuge district parasite populations (Selscan output). ALL.TORORO.IHS.ihs.out.100bins.norm.txt.zip - Figure 4b - Normalised iHS scores for the Tororo district parasite populations (Selscan output). ALL.MAYUGEvsTORORO.xpehh.xpehh.out.norm.txt.zip - Figure 4c - Normalised XP-EHH scores between Mayuge and Tororo parasite populations. MAYUGE_TORORO_2000.windowed.weir.txt.zip - Figure 4d - FST values calculated between Mayuge and Tororo populations in 2kb windows. MAYUGE_PI.windowed.pi.zip - Figure 4e, Supplementary figure 12a&c - Nucleotide diversity values calculated in 2 kb windows for Mayuge populations.
TORORO_PI.windowed.pi.zip - Figure 4e, Supplementary figure 12a&c- Nucleotide diversity values calculated in 2 kb windows for Kocoge populations (Tororo district). all.pi.treat.fix.txt.zip - Figure 5a - Nucleotide diversity results for each treatment subpopulation autosomes.dxy.5kb.treatment.txt - Figure 5b - Autosomal DXY results between clearance phenotype subpopulations. autosomes.fst.5kb.treatment.txt - Figure 5b - Autosomal FST results between clearance phenotype subpopulations. fst.windows.2kb.treatment.txt.zip - Figure 5c - FST values for comparisons between different treatment groups (Pre-treatment, post-treatment (good clearers), post-treatment (poor clearers)) assoc_err_binary.txt.zip - Figure 5d - Results of binary trait association between miracidia sampled from hosts with good clearance phenotypes (where treatment appeared to be highly effective) and miracidia isolated post-treatment from hosts with poor clearance phenotypes (where miracidia are potentially derived from parasites that survived treatment. assoc_err_linear.txt.zip - Figure 5e - Results of linear regression genome-wide association study with the ERR estimates for all 198 samples, using the mean of the posterior ERR estimates from Crellen et al. (2016) as a quantitative trait. median.coverage.txt - Supplementary figure 1 - Normalised depth of read coverage (column 4) calculated in 25 kb windows (columns 2&3) across all samples for all chromosomes (column 1).
cohort.genotyped.txt.zip - Supplementary figure 2(a-f) - Variant quality site values (used to inform variant site retention or removal).
hard_filtered.imiss.txt - Supplementary figure 2g - Per sample variant missingness (used to inform quality control)
hard_filtered_filtindv.lmiss.txt.zip - Supplementary figure 2h - Per site missingness (used to inform quality control) cv_scores.txt - Supplementary figure 5 - ADMIXTURE CV scores (column 2) for each population size (1). *_SMC_SE.csv - Supplementary figure 6 -SMC++ results (from 25 subsampled replicates) for each school subpopulation and outgroup samples. smcpp.csv - Supplementary figure 7 - SMC++ results for each school subpopulation and outgroup samples.
pi.per_host.txt.zip - Supplementary figure 8a-d - Nucleotide diversity values for each host infrapopulation.
sexing.csv - Supplementary figure 9- inferred sex (based on differential read coverage over pseudoautosomal and Z-specific regions of the Z chromosome). MAYUGE_TAJIMA_D.Tajima.D.2kb.txt.zip - Supplementary figure 11 - Tajima's D values calculated for the Mayuge population in 2kb windows. Tororo_TAJIMA_D.Tajima.D.2kb.txt.zip - Supplementary figure 11 - Tajima's D values calculated for the Tororo population in 2kb windows. genes.bed - Supplementary figures 13-18 - Coordinates of gene models (S. mansoni v7 annotation). KOCOGE_SITE_PI.sites.pi.txt.zip - Supplementary figures 13-18 - Per site nucleotide diversity values MAYUGE_TORORO_sites.weir.fst.txt.zip - Supplementary figures 13a-18a - Per site FST values between Mayuge and Tororo populations. kocoge_median.ld.txt.zip - Supplementary figure 19 - The decay of linkage disequilibrium with genomic distance between all sites within 50 kb for the Kocoge parasite samples. Chromosomes are shown in column 1, distance in column 2, median values in column 3. mayuge_median.ld.txt.zip - Supplementary figure 19 - The decay of linkage disequilibrium with genomic distance between all sites within 50 kb for the Mayuge parasite samples. Chromosomes are shown in column 1, distance in column 2, median values in column 3. median.sample.coverage.txt - Median chromosomal depth of read coverage for each sample. schools.list - List of samples and schools where they were sampled. coverage_5kb.windows.txt.zip - Per sample depth of read coverage in 5 kb windows. Columns 4,5,6 represent the median, mean and sstev of coverage for each 5kb window (columns 2&3) along each chromosome (column 1).

Authors

  • Berger, Duncan ;
  • Crellen, Thomas ;
  • Lamberton, Poppy H. L. ;
  • Allan, Fiona ;
  • Tracey, Alan ;
  • Noonan, Jennifer D. ;
  • Kabatereine, Narcis B. ;
  • Edridah M. Tukahebwa ;
  • Adriko, Moses ;
  • Holroyd, Nancy ;
  • Webster, Joanne P. ;
  • Berriman, Matthew ;
  • Cotton, James A.
0 Citations0 Mentions13% FAIR0.1 Dataset Index
10.5281/zenodo.4940589June 2021

Data release: Whole-genome sequencing of Schistosoma mansoni reveals extensive diversity with limited selection despite mass drug administration

Source data used in the publication: Berger et al. (2021) - Provisional title: 'Whole-genome sequencing of Schistosoma mansoni reveals extensive diversity with limited selection despite mass drug administration'. These data were used to generate all figures used in the publication and all files are organised and labelled specifically to run with the custom code that uses these data can be found at: http://doi.org/10.5281/zenodo.4975908.
File descriptions: SOURCE DATA.zip - All source data for all figures. Figure 1b: supplementary_data_9.txt - Metadata Figure 2a&b: 207_PCA.eigenvec - PCA eigenvectors 207_PCA.eigenval - PCA eigenvalues Figure 2c: autosomes.mdist - PLINK distance matrix used to build the neighbour joining phylogeny Figure 2d: all.pi.pixy.schools.txt - Nucleotide diversity results for each school subpopulation. Figure 2e: autosomes.dxy.5kb.schools.txt - Autosomal DXY results between school subpopulations. autosomes.fst.5kb.schools.txt - Autosomal FST results between school subpopulations. Figure 2f: admixture_all.txt - ADMIXTURE results for each sample and population sizes, column 1 represents number of populations (K), columns 3-8 represent admixture values for each population. Figure 3a, Supplementary figure 10a: sfs.csv - Site frequency spectra (allelic proportions at each frequency bin) for each school. Figure 3b: TD.all.txt - Tajima's D values calculated in 5 kb windows for each school subpopulation. Figure 4a, Supplementary figures 13-18: ALL.MAYUGE.IHS.ihs.out.100bins.norm.txt.zip - Normalised iHS scores for the Mayuge district parasite populations (Selscan output). Figure 4b, Supplementary figures 13-18: ALL.TORORO.IHS.ihs.out.100bins.norm.txt.zip - - Normalised iHS scores for the Tororo district parasite populations (Selscan output). Figure 4c, Supplementary figures 13-18: ALL.MAYUGEvsTORORO.xpehh.xpehh.out.norm.txt.zip - - Normalised XP-EHH scores between Mayuge and Tororo parasite populations. Figure 4d, Supplementary figures 13-18: MAYUGE_TORORO_2000.windowed.weir.txt.zip - FST values calculated between Mayuge and Tororo populations in 2kb windows. Figure 4e, Supplementary figures 12a&c: MAYUGE_PI.windowed.pi.zip - Nucleotide diversity values calculated in 2 kb windows for Mayuge populations. TORORO_PI.windowed.pi.zip - Nucleotide diversity values calculated in 2 kb windows for Kocoge populations (Tororo district). Figure 5a: all.pi.treat.fix.txt.zip - Nucleotide diversity results for each treatment subpopulation Figure 5b autosomes.dxy.5kb.treatment.txt - - Autosomal DXY results between clearance phenotype subpopulations. autosomes.fst.5kb.treatment.txt - Autosomal FST results between clearance phenotype subpopulations. Figure 5c: fst.windows.2kb.treatment.txt.zip - FST values for comparisons between different treatment groups (Pre-treatment, post-treatment (good clearers), post-treatment (poor clearers)) Figure 5d: assoc_err_binary.txt.zip - Results of binary trait association between miracidia sampled from hosts with good clearance phenotypes (where treatment appeared to be highly effective) and miracidia isolated post-treatment from hosts with poor clearance phenotypes (where miracidia are potentially derived from parasites that survived treatment. Figure 5e: assoc_err_linear.txt.zip - - Results of linear regression genome-wide association study with the ERR estimates for all 198 samples, using the mean of the posterior ERR estimates from Crellen et al. (2016) as a quantitative trait. Supplementary figure 1: median.coverage.txt - Normalised depth of read coverage (column 4) calculated in 25 kb windows (columns 2&3) across all samples for all chromosomes (column 1). Supplementary figure 2a-f: cohort.genotyped.txt.zip - - Variant quality site values (used to inform variant site retention or removal). Supplementary figure 2g: hard_filtered.imiss.txt - Per sample variant missingness (used to inform quality control). Supplementary figure 2h: hard_filtered_filtindv.lmiss.txt.zip - Per site missingness (used to inform quality control). Supplementary figure 3a, 4a, 4b: prunedData.eigenvec - PCA eigenvectors prunedData.eigenval - PCA eigenvalues Supplementary figure 3b: pruned_data.mdist.csv - Distance matrix used as the basis for the neighbour joining phylogeny. Supplementary figure 5: cv_scores.txt - ADMIXTURE coefficient of variation scores (column 2) for each population size (1). Supplementary figure 6: *_SMC_SE.csv - SMC++ results (from 25 subsampled replicates) for each school subpopulation and outgroup samples. Supplementary Figure 7: smcpp.csv - SMC++ results for each school subpopulation and outgroup samples. Supplementary Figure 8a-d pi.per_host.txt.zip - Nucleotide diversity values for each host infrapopulation. Supplementary Figure 9: sexing.csv - inferred sex (based on differential read coverage over pseudoautosomal and Z-specific regions of the Z chromosome). Supplementary Figure 10b: sfs_res.csv - residuals for the SFS analysis in 3a/10a. Supplementary Figure 11: MAYUGE_TAJIMA_D.Tajima.D.2kb.txt.zip - Tajima's D values calculated for the Mayuge population in 2kb windows. Tororo_TAJIMA_D.Tajima.D.2kb.txt.zip - Tajima's D values calculated for the Tororo population in 2kb windows. Supplementary Figures 13-18: genes.bed - Coordinates of gene models (S. mansoni v7 annotation). KOCOGE_SITE_PI.sites.pi.txt.zip - Per site nucleotide diversity values MAYUGE_TORORO_sites.weir.fst.txt.zip - Per site FST values between Mayuge and Tororo populations. coverage_5kb.windows.txt.zip - Per sample depth of read coverage in 5 kb windows. Columns 4,5,6 represent the median, mean and sstev of coverage for each 5kb window (columns 2&3) along each chromosome (column 1). median.sample.coverage.txt - Median chromosomal depth of read coverage for each sample. Supplementary Figure 19: kocoge_median.ld.txt.zip - - The decay of linkage disequilibrium with genomic distance between all sites within 50 kb for the Kocoge parasite samples. Chromosomes are shown in column 1, distance in column 2, median values in column 3. mayuge_median.ld.txt.zip - The decay of linkage disequilibrium with genomic distance between all sites within 50 kb for the Mayuge parasite samples. Chromosomes are shown in column 1, distance in column 2, median values in column 3. Misc files: schools.list - List of samples and schools where they were sampled.

Authors

  • Berger, Duncan ;
  • Crellen, Thomas ;
  • Lamberton, Poppy H. L. ;
  • Allan, Fiona ;
  • Tracey, Alan ;
  • Noonan, Jennifer D. ;
  • Kabatereine, Narcis B. ;
  • Edridah M. Tukahebwa ;
  • Adriko, Moses ;
  • Holroyd, Nancy ;
  • Webster, Joanne P. ;
  • Berriman, Matthew ;
  • Cotton, James A.
1 Citation0 Mentions54% FAIR0.9 Dataset Index
10.5281/zenodo.4940588June 2021

Data release: Whole-genome sequencing of Schistosoma mansoni reveals extensive diversity with limited selection despite repeated mass drug administration

Source data used in the publication: Berger et al. (2021) - Provisional title: Whole-genome sequencing of Schistosoma mansoni reveals extensive diversity with limited selection despite repeated mass drug administration. These data were used to generate all figures used in the publication and all files are organised and labelled specifically to run with the custom code that uses these data can be found at: http://doi.org/10.5281/zenodo.4975908. (I'm happy to answer any questions just email [email protected])
File descriptions: Figure 2a&b: 207_PCA.eigenvec - PCA eigenvectors 207_PCA.eigenval - PCA eigenvalues Figure 2c: autosomes.mdist - PLINK distance matrix used to build the neighbour joining phylogeny Figure 2d: all.pi.pixy.schools.txt - Nucleotide diversity results for each school subpopulation. Figure 2e: autosomes.dxy.5kb.schools.txt - Autosomal DXY results between school subpopulations. autosomes.fst.5kb.schools.txt - Autosomal FST results between school subpopulations. Figure 2f: admixture_all.txt - ADMIXTURE results for each sample and population sizes, column 1 represents number of populations (K), columns 3-8 represent admixture values for each population. Figure 3a, Supplementary figure 10a: sfs.csv - Site frequency spectra (allelic proportions at each frequency bin) for each school. Figure 3b: TD.all.txt - Tajima's D values calculated in 5 kb windows for each school subpopulation. Figure 4a, Supplementary figures 13-18: ALL.MAYUGE.IHS.ihs.out.100bins.norm.txt.zip - Normalised iHS scores for the Mayuge district parasite populations (Selscan output). Figure 4b, Supplementary figures 13-18: ALL.TORORO.IHS.ihs.out.100bins.norm.txt.zip - - Normalised iHS scores for the Tororo district parasite populations (Selscan output). Figure 4c, Supplementary figures 13-18: ALL.MAYUGEvsTORORO.xpehh.xpehh.out.norm.txt.zip - - Normalised XP-EHH scores between Mayuge and Tororo parasite populations. Figure 4d, Supplementary figures 13-18: MAYUGE_TORORO_2000.windowed.weir.txt.zip - FST values calculated between Mayuge and Tororo populations in 2kb windows. Figure 4e, Supplementary figures 12a&c: MAYUGE_PI.windowed.pi.zip - Nucleotide diversity values calculated in 2 kb windows for Mayuge populations. TORORO_PI.windowed.pi.zip - Nucleotide diversity values calculated in 2 kb windows for Kocoge populations (Tororo district). Figure 5a: all.pi.treat.fix.txt.zip - Nucleotide diversity results for each treatment subpopulation Figure 5b autosomes.dxy.5kb.treatment.txt - - Autosomal DXY results between clearance phenotype subpopulations. autosomes.fst.5kb.treatment.txt - Autosomal FST results between clearance phenotype subpopulations. Figure 5c: fst.windows.2kb.treatment.txt.zip - FST values for comparisons between different treatment groups (Pre-treatment, post-treatment (good clearers), post-treatment (poor clearers)) Figure 5d: assoc_err_binary.txt.zip - Results of binary trait association between miracidia sampled from hosts with good clearance phenotypes (where treatment appeared to be highly effective) and miracidia isolated post-treatment from hosts with poor clearance phenotypes (where miracidia are potentially derived from parasites that survived treatment. Figure 5e: assoc_err_linear.txt.zip - - Results of linear regression genome-wide association study with the ERR estimates for all 198 samples, using the mean of the posterior ERR estimates from Crellen et al. (2016) as a quantitative trait. Supplementary figure 1: median.coverage.txt - Normalised depth of read coverage (column 4) calculated in 25 kb windows (columns 2&3) across all samples for all chromosomes (column 1). Supplementary figure 2a-f: cohort.genotyped.txt.zip - - Variant quality site values (used to inform variant site retention or removal). Supplementary figure 2g: hard_filtered.imiss.txt - Per sample variant missingness (used to inform quality control). Supplementary figure 2h: hard_filtered_filtindv.lmiss.txt.zip - Per site missingness (used to inform quality control). Supplementary figure 3a, 4a, 4b: prunedData.eigenvec - PCA eigenvectors prunedData.eigenval - PCA eigenvalues Supplementary figure 3b: pruned_data.mdist.csv - Distance matrix used as the basis for the neighbour joining phylogeny. Supplementary figure 5: cv_scores.txt - ADMIXTURE coefficient of variation scores (column 2) for each population size (1). Supplementary figure 6: *_SMC_SE.csv - SMC++ results (from 25 subsampled replicates) for each school subpopulation and outgroup samples. Supplementary figure 7: smcpp.csv - SMC++ results for each school subpopulation and outgroup samples. Supplementary figure 8a-d pi.per_host.txt.zip - Nucleotide diversity values for each host infrapopulation. Supplementary figure 9: sexing.csv - inferred sex (based on differential read coverage over pseudoautosomal and Z-specific regions of the Z chromosome). Supplementary figure 11: MAYUGE_TAJIMA_D.Tajima.D.2kb.txt.zip - Tajima's D values calculated for the Mayuge population in 2kb windows. Tororo_TAJIMA_D.Tajima.D.2kb.txt.zip - Tajima's D values calculated for the Tororo population in 2kb windows. Supplementary figures 13-18: genes.bed - Coordinates of gene models (S. mansoni v7 annotation). KOCOGE_SITE_PI.sites.pi.txt.zip - Per site nucleotide diversity values MAYUGE_TORORO_sites.weir.fst.txt.zip - Per site FST values between Mayuge and Tororo populations. coverage_5kb.windows.txt.zip - Per sample depth of read coverage in 5 kb windows. Columns 4,5,6 represent the median, mean and sstev of coverage for each 5kb window (columns 2&3) along each chromosome (column 1). median.sample.coverage.txt - Median chromosomal depth of read coverage for each sample. Supplementary figure 19: kocoge_median.ld.txt.zip - - The decay of linkage disequilibrium with genomic distance between all sites within 50 kb for the Kocoge parasite samples. Chromosomes are shown in column 1, distance in column 2, median values in column 3. mayuge_median.ld.txt.zip - The decay of linkage disequilibrium with genomic distance between all sites within 50 kb for the Mayuge parasite samples. Chromosomes are shown in column 1, distance in column 2, median values in column 3. Misc files: schools.list - List of samples and schools where they were sampled.

Authors

  • Berger, Duncan ;
  • Crellen, Thomas ;
  • Lamberton, Poppy H. L. ;
  • Allan, Fiona ;
  • Tracey, Alan ;
  • Noonan, Jennifer D. ;
  • Kabatereine, Narcis B. ;
  • Edridah M. Tukahebwa ;
  • Adriko, Moses ;
  • Holroyd, Nancy ;
  • Webster, Joanne P. ;
  • Berriman, Matthew ;
  • Cotton, James A.
0 Citations0 Mentions77% FAIR0.8 Dataset Index
10.5281/zenodo.4982893June 2021

Data release: Population genomics of Schistosoma mansoni reveals extensive diversity with limited selection despite repeated mass drug administration

Source data used in the publication: Berger et al. (2021) - Provisional title: 'Population genomics of Schistosoma mansoni reveals extensive diversity with limited selection despite repeated mass drug administration'. These data were used to generate all figures used in the publication and all files are organised and labelled specifically to run with the custom code that uses these data can be found at: http://doi.org/10.5281/zenodo.4975908. (I'm happy to answer any questions just email [email protected])
File descriptions: Figure 1b: supplementary_data_9.txt - Metadata Figure 2a&b: 207_PCA.eigenvec - PCA eigenvectors 207_PCA.eigenval - PCA eigenvalues Figure 2c: autosomes.mdist - PLINK distance matrix used to build the neighbour joining phylogeny Figure 2d: all.pi.pixy.schools.txt - Nucleotide diversity results for each school subpopulation. Figure 2e: autosomes.dxy.5kb.schools.txt - Autosomal DXY results between school subpopulations. autosomes.fst.5kb.schools.txt - Autosomal FST results between school subpopulations. Figure 2f: admixture_all.txt - ADMIXTURE results for each sample and population sizes, column 1 represents number of populations (K), columns 3-8 represent admixture values for each population. Figure 3a, Supplementary figure 10a: sfs.csv - Site frequency spectra (allelic proportions at each frequency bin) for each school. Figure 3b: TD.all.txt - Tajima's D values calculated in 5 kb windows for each school subpopulation. Figure 4a, Supplementary figures 13-18: ALL.MAYUGE.IHS.ihs.out.100bins.norm.txt.zip - Normalised iHS scores for the Mayuge district parasite populations (Selscan output). Figure 4b, Supplementary figures 13-18: ALL.TORORO.IHS.ihs.out.100bins.norm.txt.zip - - Normalised iHS scores for the Tororo district parasite populations (Selscan output). Figure 4c, Supplementary figures 13-18: ALL.MAYUGEvsTORORO.xpehh.xpehh.out.norm.txt.zip - - Normalised XP-EHH scores between Mayuge and Tororo parasite populations. Figure 4d, Supplementary figures 13-18: MAYUGE_TORORO_2000.windowed.weir.txt.zip - FST values calculated between Mayuge and Tororo populations in 2kb windows. Figure 4e, Supplementary figures 12a&c: MAYUGE_PI.windowed.pi.zip - Nucleotide diversity values calculated in 2 kb windows for Mayuge populations. TORORO_PI.windowed.pi.zip - Nucleotide diversity values calculated in 2 kb windows for Kocoge populations (Tororo district). Figure 5a: all.pi.treat.fix.txt.zip - Nucleotide diversity results for each treatment subpopulation Figure 5b autosomes.dxy.5kb.treatment.txt - - Autosomal DXY results between clearance phenotype subpopulations. autosomes.fst.5kb.treatment.txt - Autosomal FST results between clearance phenotype subpopulations. Figure 5c: fst.windows.2kb.treatment.txt.zip - FST values for comparisons between different treatment groups (Pre-treatment, post-treatment (good clearers), post-treatment (poor clearers)) Figure 5d: assoc_err_binary.txt.zip - Results of binary trait association between miracidia sampled from hosts with good clearance phenotypes (where treatment appeared to be highly effective) and miracidia isolated post-treatment from hosts with poor clearance phenotypes (where miracidia are potentially derived from parasites that survived treatment. Figure 5e: assoc_err_linear.txt.zip - - Results of linear regression genome-wide association study with the ERR estimates for all 198 samples, using the mean of the posterior ERR estimates from Crellen et al. (2016) as a quantitative trait. Supplementary figure 1: median.coverage.txt - Normalised depth of read coverage (column 4) calculated in 25 kb windows (columns 2&3) across all samples for all chromosomes (column 1). Supplementary figure 2a-f: cohort.genotyped.txt.zip - - Variant quality site values (used to inform variant site retention or removal). Supplementary figure 2g: hard_filtered.imiss.txt - Per sample variant missingness (used to inform quality control). Supplementary figure 2h: hard_filtered_filtindv.lmiss.txt.zip - Per site missingness (used to inform quality control). Supplementary figure 3a, 4a, 4b: prunedData.eigenvec - PCA eigenvectors prunedData.eigenval - PCA eigenvalues Supplementary figure 3b: pruned_data.mdist.csv - Distance matrix used as the basis for the neighbour joining phylogeny. Supplementary figure 5: cv_scores.txt - ADMIXTURE coefficient of variation scores (column 2) for each population size (1). Supplementary figure 6: *_SMC_SE.csv - SMC++ results (from 25 subsampled replicates) for each school subpopulation and outgroup samples. Supplementary Figure 7: smcpp.csv - SMC++ results for each school subpopulation and outgroup samples. Supplementary Figure 8a-d pi.per_host.txt.zip - Nucleotide diversity values for each host infrapopulation. Supplementary Figure 9: sexing.csv - inferred sex (based on differential read coverage over pseudoautosomal and Z-specific regions of the Z chromosome). Supplementary Figure 10b: sfs_res.csv - residuals for the SFS analysis in 3a/10a. Supplementary Figure 11: MAYUGE_TAJIMA_D.Tajima.D.2kb.txt.zip - Tajima's D values calculated for the Mayuge population in 2kb windows. Tororo_TAJIMA_D.Tajima.D.2kb.txt.zip - Tajima's D values calculated for the Tororo population in 2kb windows. Supplementary Figures 13-18: genes.bed - Coordinates of gene models (S. mansoni v7 annotation). KOCOGE_SITE_PI.sites.pi.txt.zip - Per site nucleotide diversity values MAYUGE_TORORO_sites.weir.fst.txt.zip - Per site FST values between Mayuge and Tororo populations. coverage_5kb.windows.txt.zip - Per sample depth of read coverage in 5 kb windows. Columns 4,5,6 represent the median, mean and sstev of coverage for each 5kb window (columns 2&3) along each chromosome (column 1). median.sample.coverage.txt - Median chromosomal depth of read coverage for each sample. Supplementary Figure 19: kocoge_median.ld.txt.zip - - The decay of linkage disequilibrium with genomic distance between all sites within 50 kb for the Kocoge parasite samples. Chromosomes are shown in column 1, distance in column 2, median values in column 3. mayuge_median.ld.txt.zip - The decay of linkage disequilibrium with genomic distance between all sites within 50 kb for the Mayuge parasite samples. Chromosomes are shown in column 1, distance in column 2, median values in column 3. Misc files: schools.list - List of samples and schools where they were sampled.

Authors

  • Berger, Duncan ;
  • Crellen, Thomas ;
  • Lamberton, Poppy H. L. ;
  • Allan, Fiona ;
  • Tracey, Alan ;
  • Noonan, Jennifer D. ;
  • Kabatereine, Narcis B. ;
  • Edridah M. Tukahebwa ;
  • Adriko, Moses ;
  • Holroyd, Nancy ;
  • Webster, Joanne P. ;
  • Berriman, Matthew ;
  • Cotton, James A.
0 Citations2 Mentions77% FAIR1.8 Dataset Index
10.5281/zenodo.5013683June 2021

Data release: Whole-genome sequencing of Schistosoma mansoni reveals extensive diversity with limited selection despite mass drug administration

Source data used in the publication: Berger et al. (2021) - Provisional title: 'Whole-genome sequencing of Schistosoma mansoni reveals extensive diversity with limited selection despite mass drug administration'. These data were used to generate all figures used in the publication and all files are organised and labelled specifically to run with the custom code that uses these data can be found at: http://doi.org/10.5281/zenodo.4975908. (I'm happy to answer any questions just email [email protected])
File descriptions: source_data.xlsx - All source data for all figures. Figure 1b: supplementary_data_9.txt - Metadata Figure 2a&b: 207_PCA.eigenvec - PCA eigenvectors 207_PCA.eigenval - PCA eigenvalues Figure 2c: autosomes.mdist - PLINK distance matrix used to build the neighbour joining phylogeny Figure 2d: all.pi.pixy.schools.txt - Nucleotide diversity results for each school subpopulation. Figure 2e: autosomes.dxy.5kb.schools.txt - Autosomal DXY results between school subpopulations. autosomes.fst.5kb.schools.txt - Autosomal FST results between school subpopulations. Figure 2f: admixture_all.txt - ADMIXTURE results for each sample and population sizes, column 1 represents number of populations (K), columns 3-8 represent admixture values for each population. Figure 3a, Supplementary figure 10a: sfs.csv - Site frequency spectra (allelic proportions at each frequency bin) for each school. Figure 3b: TD.all.txt - Tajima's D values calculated in 5 kb windows for each school subpopulation. Figure 4a, Supplementary figures 13-18: ALL.MAYUGE.IHS.ihs.out.100bins.norm.txt.zip - Normalised iHS scores for the Mayuge district parasite populations (Selscan output). Figure 4b, Supplementary figures 13-18: ALL.TORORO.IHS.ihs.out.100bins.norm.txt.zip - - Normalised iHS scores for the Tororo district parasite populations (Selscan output). Figure 4c, Supplementary figures 13-18: ALL.MAYUGEvsTORORO.xpehh.xpehh.out.norm.txt.zip - - Normalised XP-EHH scores between Mayuge and Tororo parasite populations. Figure 4d, Supplementary figures 13-18: MAYUGE_TORORO_2000.windowed.weir.txt.zip - FST values calculated between Mayuge and Tororo populations in 2kb windows. Figure 4e, Supplementary figures 12a&c: MAYUGE_PI.windowed.pi.zip - Nucleotide diversity values calculated in 2 kb windows for Mayuge populations. TORORO_PI.windowed.pi.zip - Nucleotide diversity values calculated in 2 kb windows for Kocoge populations (Tororo district). Figure 5a: all.pi.treat.fix.txt.zip - Nucleotide diversity results for each treatment subpopulation Figure 5b autosomes.dxy.5kb.treatment.txt - - Autosomal DXY results between clearance phenotype subpopulations. autosomes.fst.5kb.treatment.txt - Autosomal FST results between clearance phenotype subpopulations. Figure 5c: fst.windows.2kb.treatment.txt.zip - FST values for comparisons between different treatment groups (Pre-treatment, post-treatment (good clearers), post-treatment (poor clearers)) Figure 5d: assoc_err_binary.txt.zip - Results of binary trait association between miracidia sampled from hosts with good clearance phenotypes (where treatment appeared to be highly effective) and miracidia isolated post-treatment from hosts with poor clearance phenotypes (where miracidia are potentially derived from parasites that survived treatment. Figure 5e: assoc_err_linear.txt.zip - - Results of linear regression genome-wide association study with the ERR estimates for all 198 samples, using the mean of the posterior ERR estimates from Crellen et al. (2016) as a quantitative trait. Supplementary figure 1: median.coverage.txt - Normalised depth of read coverage (column 4) calculated in 25 kb windows (columns 2&3) across all samples for all chromosomes (column 1). Supplementary figure 2a-f: cohort.genotyped.txt.zip - - Variant quality site values (used to inform variant site retention or removal). Supplementary figure 2g: hard_filtered.imiss.txt - Per sample variant missingness (used to inform quality control). Supplementary figure 2h: hard_filtered_filtindv.lmiss.txt.zip - Per site missingness (used to inform quality control). Supplementary figure 3a, 4a, 4b: prunedData.eigenvec - PCA eigenvectors prunedData.eigenval - PCA eigenvalues Supplementary figure 3b: pruned_data.mdist.csv - Distance matrix used as the basis for the neighbour joining phylogeny. Supplementary figure 5: cv_scores.txt - ADMIXTURE coefficient of variation scores (column 2) for each population size (1). Supplementary figure 6: *_SMC_SE.csv - SMC++ results (from 25 subsampled replicates) for each school subpopulation and outgroup samples. Supplementary Figure 7: smcpp.csv - SMC++ results for each school subpopulation and outgroup samples. Supplementary Figure 8a-d pi.per_host.txt.zip - Nucleotide diversity values for each host infrapopulation. Supplementary Figure 9: sexing.csv - inferred sex (based on differential read coverage over pseudoautosomal and Z-specific regions of the Z chromosome). Supplementary Figure 10b: sfs_res.csv - residuals for the SFS analysis in 3a/10a. Supplementary Figure 11: MAYUGE_TAJIMA_D.Tajima.D.2kb.txt.zip - Tajima's D values calculated for the Mayuge population in 2kb windows. Tororo_TAJIMA_D.Tajima.D.2kb.txt.zip - Tajima's D values calculated for the Tororo population in 2kb windows. Supplementary Figures 13-18: genes.bed - Coordinates of gene models (S. mansoni v7 annotation). KOCOGE_SITE_PI.sites.pi.txt.zip - Per site nucleotide diversity values MAYUGE_TORORO_sites.weir.fst.txt.zip - Per site FST values between Mayuge and Tororo populations. coverage_5kb.windows.txt.zip - Per sample depth of read coverage in 5 kb windows. Columns 4,5,6 represent the median, mean and sstev of coverage for each 5kb window (columns 2&3) along each chromosome (column 1). median.sample.coverage.txt - Median chromosomal depth of read coverage for each sample. Supplementary Figure 19: kocoge_median.ld.txt.zip - - The decay of linkage disequilibrium with genomic distance between all sites within 50 kb for the Kocoge parasite samples. Chromosomes are shown in column 1, distance in column 2, median values in column 3. mayuge_median.ld.txt.zip - The decay of linkage disequilibrium with genomic distance between all sites within 50 kb for the Mayuge parasite samples. Chromosomes are shown in column 1, distance in column 2, median values in column 3. Misc files: schools.list - List of samples and schools where they were sampled.

Authors

  • Berger, Duncan ;
  • Crellen, Thomas ;
  • Lamberton, Poppy H. L. ;
  • Allan, Fiona ;
  • Tracey, Alan ;
  • Noonan, Jennifer D. ;
  • Kabatereine, Narcis B. ;
  • Edridah M. Tukahebwa ;
  • Adriko, Moses ;
  • Holroyd, Nancy ;
  • Webster, Joanne P. ;
  • Berriman, Matthew ;
  • Cotton, James A.
0 Citations0 Mentions77% FAIR0.8 Dataset Index
10.5281/zenodo.5045162June 2021

Data release: Whole-genome sequencing of Schistosoma mansoni reveals extensive diversity with limited selection despite mass drug administration

Source data used in the publication: Berger et al. (2021) - Provisional title: 'Whole-genome sequencing of Schistosoma mansoni reveals extensive diversity with limited selection despite mass drug administration'. These data were used to generate all figures used in the publication and all files are organised and labelled specifically to run with the custom code that uses these data can be found at: http://doi.org/10.5281/zenodo.4975908.
File descriptions: SOURCE DATA.zip - All source data for all figures. Figure 1b: supplementary_data_9.txt - Metadata Figure 2a&b: 207_PCA.eigenvec - PCA eigenvectors 207_PCA.eigenval - PCA eigenvalues Figure 2c: autosomes.mdist - PLINK distance matrix used to build the neighbour joining phylogeny Figure 2d: all.pi.pixy.schools.txt - Nucleotide diversity results for each school subpopulation. Figure 2e: autosomes.dxy.5kb.schools.txt - Autosomal DXY results between school subpopulations. autosomes.fst.5kb.schools.txt - Autosomal FST results between school subpopulations. Figure 2f: admixture_all.txt - ADMIXTURE results for each sample and population sizes, column 1 represents number of populations (K), columns 3-8 represent admixture values for each population. Figure 3a, Supplementary figure 10a: sfs.csv - Site frequency spectra (allelic proportions at each frequency bin) for each school. Figure 3b: TD.all.txt - Tajima's D values calculated in 5 kb windows for each school subpopulation. Figure 4a, Supplementary figures 13-18: ALL.MAYUGE.IHS.ihs.out.100bins.norm.txt.zip - Normalised iHS scores for the Mayuge district parasite populations (Selscan output). Figure 4b, Supplementary figures 13-18: ALL.TORORO.IHS.ihs.out.100bins.norm.txt.zip - - Normalised iHS scores for the Tororo district parasite populations (Selscan output). Figure 4c, Supplementary figures 13-18: ALL.MAYUGEvsTORORO.xpehh.xpehh.out.norm.txt.zip - - Normalised XP-EHH scores between Mayuge and Tororo parasite populations. Figure 4d, Supplementary figures 13-18: MAYUGE_TORORO_2000.windowed.weir.txt.zip - FST values calculated between Mayuge and Tororo populations in 2kb windows. Figure 4e, Supplementary figures 12a&c: MAYUGE_PI.windowed.pi.zip - Nucleotide diversity values calculated in 2 kb windows for Mayuge populations. TORORO_PI.windowed.pi.zip - Nucleotide diversity values calculated in 2 kb windows for Kocoge populations (Tororo district). Figure 5a: all.pi.treat.fix.txt.zip - Nucleotide diversity results for each treatment subpopulation Figure 5b autosomes.dxy.5kb.treatment.txt - - Autosomal DXY results between clearance phenotype subpopulations. autosomes.fst.5kb.treatment.txt - Autosomal FST results between clearance phenotype subpopulations. Figure 5c: fst.windows.2kb.treatment.txt.zip - FST values for comparisons between different treatment groups (Pre-treatment, post-treatment (good clearers), post-treatment (poor clearers)) Figure 5d: assoc_err_binary.txt.zip - Results of binary trait association between miracidia sampled from hosts with good clearance phenotypes (where treatment appeared to be highly effective) and miracidia isolated post-treatment from hosts with poor clearance phenotypes (where miracidia are potentially derived from parasites that survived treatment. Figure 5e: assoc_err_linear.txt.zip - - Results of linear regression genome-wide association study with the ERR estimates for all 198 samples, using the mean of the posterior ERR estimates from Crellen et al. (2016) as a quantitative trait. Supplementary figure 1: median.coverage.txt - Normalised depth of read coverage (column 4) calculated in 25 kb windows (columns 2&3) across all samples for all chromosomes (column 1). Supplementary figure 2a-f: cohort.genotyped.txt.zip - - Variant quality site values (used to inform variant site retention or removal). Supplementary figure 2g: hard_filtered.imiss.txt - Per sample variant missingness (used to inform quality control). Supplementary figure 2h: hard_filtered_filtindv.lmiss.txt.zip - Per site missingness (used to inform quality control). Supplementary figure 3a, 4a, 4b: prunedData.eigenvec - PCA eigenvectors prunedData.eigenval - PCA eigenvalues Supplementary figure 3b: pruned_data.mdist.csv - Distance matrix used as the basis for the neighbour joining phylogeny. Supplementary figure 5: cv_scores.txt - ADMIXTURE coefficient of variation scores (column 2) for each population size (1). Supplementary figure 6: *_SMC_SE.csv - SMC++ results (from 25 subsampled replicates) for each school subpopulation and outgroup samples. Supplementary Figure 7: smcpp.csv - SMC++ results for each school subpopulation and outgroup samples. Supplementary Figure 8a-d pi.per_host.txt.zip - Nucleotide diversity values for each host infrapopulation. Supplementary Figure 9: sexing.csv - inferred sex (based on differential read coverage over pseudoautosomal and Z-specific regions of the Z chromosome). Supplementary Figure 10b: sfs_res.csv - residuals for the SFS analysis in 3a/10a. Supplementary Figure 11: MAYUGE_TAJIMA_D.Tajima.D.2kb.txt.zip - Tajima's D values calculated for the Mayuge population in 2kb windows. Tororo_TAJIMA_D.Tajima.D.2kb.txt.zip - Tajima's D values calculated for the Tororo population in 2kb windows. Supplementary Figures 13-18: genes.bed - Coordinates of gene models (S. mansoni v7 annotation). KOCOGE_SITE_PI.sites.pi.txt.zip - Per site nucleotide diversity values MAYUGE_TORORO_sites.weir.fst.txt.zip - Per site FST values between Mayuge and Tororo populations. coverage_5kb.windows.txt.zip - Per sample depth of read coverage in 5 kb windows. Columns 4,5,6 represent the median, mean and sstev of coverage for each 5kb window (columns 2&3) along each chromosome (column 1). median.sample.coverage.txt - Median chromosomal depth of read coverage for each sample. Supplementary Figure 19: kocoge_median.ld.txt.zip - - The decay of linkage disequilibrium with genomic distance between all sites within 50 kb for the Kocoge parasite samples. Chromosomes are shown in column 1, distance in column 2, median values in column 3. mayuge_median.ld.txt.zip - The decay of linkage disequilibrium with genomic distance between all sites within 50 kb for the Mayuge parasite samples. Chromosomes are shown in column 1, distance in column 2, median values in column 3. Misc files: schools.list - List of samples and schools where they were sampled.

Authors

  • Berger, Duncan ;
  • Crellen, Thomas ;
  • Lamberton, Poppy H. L. ;
  • Allan, Fiona ;
  • Tracey, Alan ;
  • Noonan, Jennifer D. ;
  • Kabatereine, Narcis B. ;
  • Edridah M. Tukahebwa ;
  • Adriko, Moses ;
  • Holroyd, Nancy ;
  • Webster, Joanne P. ;
  • Berriman, Matthew ;
  • Cotton, James A.
0 Citations0 Mentions77% FAIR0.8 Dataset Index
10.5281/zenodo.5090014June 2021

Anonymised raw Schistosoma mansoni and host ABO blood group data for: Francoeur et al. ABO blood group do not predict Schistosoma mansoni infection profiles in highly endemic villages of Uganda

Raw anonymised Schistosoma mansoni and human host ABO blood group dataset from school-aged children, aged 6-14, from three high endemicity villages in Mayuge District Uganda. Data include age, sex, height, weight, village, date, timepoint pre and post treatment with praziquantel and albendazole, up to three days per timepoint of duplicate Kato-Katz slides with infection intensity data for S. mansoni and soil-transmitted helminth infections, including hookworm, Ascaris lumbricoides, Trichuris trichiura, Hymenolepis nana, and Enterobius vermicularis, recorded, and ABO and Rhesus positive or negative blood group type.

Authors

  • Francoeur, Rachel ;
  • Atuhaire, Alon ;
  • Moses, Arinaitwe ;
  • Adriko, Moses ;
  • Ajambo, Diana ;
  • Nankasi, Andrina ;
  • Babayan, Simon ;
  • Lamberton, Poppy
1 Citation0 Mentions73% FAIR1.9 Dataset Index
10.5525/gla.researchdata.1219January 2021

Impacts of host gender on Schistosoma mansoni risk in rural Uganda—A mixed methods approach

Time to reinfection and microsatellite data for epidemiological and genetic analyses

Authors

  • Trienekens, Suzan ;
  • Faust, Christina ;
  • Meginnis, Keila ;
  • Pickering, Lucy ;
  • Ericsson, Olivia ;
  • Nankasi, Andrina ;
  • Moses, Arinaitwe ;
  • Tukahebwa, Edridah ;
  • Lamberton, Poppy
1 Citation0 Mentions73% FAIR1.1 Dataset Index
10.5525/gla.researchdata.1009January 2020

Two-year longitudinal survey reveals high genetic diversity of Schistosoma mansoni with adult worms surviving praziquantel treatment at the start of mass drug administration in Uganda

Dataset to underpin associated article.

Authors

  • Faust, Christina ;
  • Crotti, Marco ;
  • Moses, A ;
  • Ogutto, D ;
  • Wamboko, A ;
  • Adriko, M ;
  • Adekanle, E ;
  • Kabatereine, N ;
  • Tukahebwa, E ;
  • Norton, A ;
  • Gower, C ;
  • Webster, J ;
  • Lamberton, Poppy
0 Citations0 Mentions73% FAIR1.8 Dataset Index
10.5525/gla.researchdata.931January 2019