Scholar Data

Supplementary dataset for The sociality of pathogenicity and virulence throughout Human pathogen diversity (Version: 1)

This supplementary dataset provides the list of species included in each analysis and corresponding representative strain genomes used. There are two lists of species: one used for the comparative analysis of virulence factors and pathogenicity, and one used for the case fatality rate comparative analysis. For each set, the resulting annotations are provided in tab-delimited tables format, for both gene-level annotations, and summarised at species-level. These datasets are directly usable to replicate our analysis, using codes provided at https://github.com/CamilleAnna/CooperativePathogenicityVirulence_repo.git. 1.1_pathogen_commensal_genomes_118.txt: list of pathogen and non-pathogen species and corresponding representative strain genome used in comparative analysis of virulence factors and pathogenicity. Fields: species_id = species/strain id in MIDAS database genome_name = corresponding genome name in PATRIC database, genome_id = corresponding genome id in PATRIC database is_rep_genome = is genome representative strain in MIDAS database count_genomes = number of genomes for this species in MIDAS database genus = genus for species species = species name for plotting pathogen = binary variable of whether species was classified as pathogen or non-pathogen gram_profile = gram profile assigned to species (used for specifying gram profile in PSORTb pipeline), with p = gram-positive, n = gram-negative, OM+ = gram-positive with outer membrane. 2.2_assembled_SPECIES_annotation.txt: assembled dataset for species included in comparative analysis of pathogenicity (n = 118 species). Fields 1 to 3: genome used for analysis, with: species_id = species/strain id in MIDAS database genus = genus for species species = species name for plotting. Fields 4 to 13: genome annotations, with gram_profile = species gram staining profile total_cds = proteome size is_victor_vf = number of CDS annotated as virulence factors in VICTOR database pathogen = pathogenic status other fields indicate the number of genes for this species annotated with each of the six forms of cooperation. 2.3_assembled_GENES_annotation.txt: gene-level annotations that were used to assemble the table “2.2_assembled_SPECIES_annotation.txt”. This dataset was used for the comparative analysis of virulence factors (n = 118 species, nrows = 367162 genes) total. Each CDS if flagged “1” if it was annotated with the 6 forms of cooperation, and whether it was recorded as a virulence factor in VICTOR database. “peg” and “product_patric” CDS name and annotation in PATRIC database. 3.2_cfr_SUPFAM_match: list of pathogen and non-pathogen species and corresponding representative strain genome used in comparative analysis of virulence factors and pathogenicity. Fields: Legget_species: species name in Legget et al (2017) supplementary data table matching_supfam_id: corresponding strain code in SCOP database genome_name: corresponding genome name in PATRIC database genome_id: corresponding genome id in PATRIC database file_name_record: internal naming for annotations gram_profile: gram profile assigned to species (used for specifying gram profile in PSORTb pipeline), with p = gram positive, n = gram-negative, OM+ = gram-positive with outer membrane. 4.1_assembled_CFR_SPECIES_annotation.txt: assembled dataset for species included in comparative analysis of case fatality rate (n = 50 species). Fields 1 to 10 are ecological data about species provided in Leggett et al (2017) supplementary tables. Fields 11 to 13: genome used for analysis, with: species_id = internal naming genome_id = corresponding PATRIC genome id matching_supfam_id = corresponding strain code in SCOP database Fields 14 to 22: genome annotations, with: gram_profile = species gram staining profile total_cds = proteome size is_victor_vf = number of CDS annotated as virulence factors in VICTOR database other fields indicate number of genes for this species annotated with each of the six forms of cooperation. 4.2_assembled_CFR_GENES_annotation.txt: gene-level annotations that was used to assemble the table “4.2_assembled_CFR_SPECIES_annotation.txt”, n = 50 species, nrows = 180754 genes total. Each CDS if flagged “1” if it was annotated with the 6 forms of cooperation, and whether it was recorded as a virulence factor in VICTOR database. “peg” and “product_patric” CDS name and annotation in PATRIC database. midas_tree_renamed.newick: phylogeny used for comparative analyses of virulence factors and pathogenicity. We used the MIDAS phylogeny of PATRIC genomes, which we trimmed to our focus species (N = 118) and ultrametricised using chronopl function in ape package. Supfam_cfr_tree.newick: phylogeny used for comparative analyses of virulence factors and pathogenicity. We used the SCOP database generated phylogeny which we ultrametricised using chronopl function in ape package.

Authors

, Camille

0 Citations0 Mentions73% FAIR1.6 Dataset Index

10.5281/zenodo.50897772021

Supplementary dataset for The sociality of pathogenicity and virulence throughout Human pathogen diversity (Version: 1)

Authors

, Camille

0 Citations0 Mentions73% FAIR1.6 Dataset Index

10.5281/zenodo.50897762021

Kin selection explains the evolution of cooperation in the gut microbiota, by Simonet & McNally, 2020, Dataset S1 and codes for statistical analysis and figures production (Version: 1)

Dataset S1 contains all raw and processed material referred to in the published article "Kin selection explains the evolution of cooperation in the gut microbiota". R codes files provide all codes to replicate the analysis. Please refer to the README file for a description of all code files. We also provide access to these data and codes at our GitHub (https://github.com/CamilleAnna/HamiltonRuleMicrobiome gitRepos.git) which can be cloned to directly re-run this analysis. Legends for Dataset S1: Sheet 1: Metagenomic samples used and access links. Sheet 2: Reference on bacterial cooperation retrieved from Web of Science search: TI¯((microb* OR bacter* OR microorganis* OR micro-organis*) AND (coop* OR social*) Sheet 3: Retained bacteria cooperation keywords Sheet 4: GOs identified by annotating all MIDAS database genomes (5944 genomes) with PANNZER2. Sheet 5: Full list of potential bacterial cooperation GO terms and description of manual curation decisions. Sheet 6: Final list of bacterial cooperation GO used for the analysis Sheet 7: Genomic diversity of the bacterial population within and across host. Computed from MIDAS snp_diversity.py pipeline. Sheet 8: final dataset for statistical analysis. Sheet 9: per-gene annotation of cooperation.

Authors

Simonet, Camille ;
McNally, Luke

0 Citations0 Mentions73% FAIR0.8 Dataset Index

10.5281/zenodo.44548202021

Kin selection explains the evolution of cooperation in the gut microbiota, by Simonet & McNally, 2020, Dataset S1 and codes for statistical analysis and figures production (Version: 2)

Dataset S1 contains all raw and processed material referred to in the published article "Kin selection explains the evolution of cooperation in the gut microbiota". R codes files provide all codes to replicate the analysis. Please refer to the README file for a description of all code files. The manifest files are those obtained by accessing the HMP portal on April 2020 under Project > HMP, Body Site > feces, Studies>WGS-PP1, File Type > WGS raw sequences set, File format > FASTQ. We also provide access to these data and codes at our GitHub (https://github.com/CamilleAnna/HamiltonRuleMicrobiome gitRepos.git) which can be cloned to directly re-run this analysis. Legends for Dataset S1: Sheet 1: Metagenomic samples used and access links. Sheet 2: Reference on bacterial cooperation retrieved from Web of Science search: TI¯((microb* OR bacter* OR microorganis* OR micro-organis*) AND (coop* OR social*) Sheet 3: Retained bacteria cooperation keywords Sheet 4: GOs identified by annotating all MIDAS database genomes (5944 genomes) with PANNZER2. Sheet 5: Full list of potential bacterial cooperation GO terms and description of manual curation decisions. Sheet 6: Final list of bacterial cooperation GO used for the analysis Sheet 7: Genomic diversity of the bacterial population within and across host. Computed from MIDAS snp_diversity.py pipeline. Sheet 8: final dataset for statistical analysis. Sheet 9: per-gene annotation of cooperation.

Authors

Simonet, Camille ;
McNally, Luke

0 Citations0 Mentions79% FAIR0.1 Dataset Index

10.5281/zenodo.44548192021

Kin selection explains the evolution of cooperation in the gut microbiota, by Simonet & McNally, 2020, Dataset S1 and codes for statistical analysis and figures production (Version: 2)

Dataset S1 contains all raw and processed material referred to in the published article "Kin selection explains the evolution of cooperation in the gut microbiota". R codes files provide all codes to replicate the analysis. Please refer to the README file for a description of all code files. The manifest files are those obtained by accessing the HMP portal on April 2020 under Project > HMP, Body Site > feces, Studies>WGS-PP1, File Type > WGS raw sequences set, File format > FASTQ. We also provide access to these data and codes at our GitHub (https://github.com/CamilleAnna/HamiltonRuleMicrobiome gitRepos.git) which can be cloned to directly re-run this analysis. Legends for Dataset S1: Sheet 1: Metagenomic samples used and access links. Sheet 2: Reference on bacterial cooperation retrieved from Web of Science search: TI¯((microb* OR bacter* OR microorganis* OR micro-organis*) AND (coop* OR social*) Sheet 3: Retained bacteria cooperation keywords Sheet 4: GOs identified by annotating all MIDAS database genomes (5944 genomes) with PANNZER2. Sheet 5: Full list of potential bacterial cooperation GO terms and description of manual curation decisions. Sheet 6: Final list of bacterial cooperation GO used for the analysis Sheet 7: Genomic diversity of the bacterial population within and across host. Computed from MIDAS snp_diversity.py pipeline. Sheet 8: final dataset for statistical analysis. Sheet 9: per-gene annotation of cooperation.

Authors

Simonet, Camille ;
McNally, Luke

1 Citation0 Mentions77% FAIR1.2 Dataset Index

10.5281/zenodo.44548672021

Automated Author Profile
Simonet, Camille
University of Edinburgh
0000-0003-1812-1794

Simonet, Camille

Current S-Index

Average Dataset Index per Dataset

Total Datasets

Average FAIR Score

Total Citations

Total Mentions

S-Index Interpretation

S-Index Over Time

Cumulative Citations Over Time

Cumulative Mentions Over Time

Datasets

Supplementary dataset for The sociality of pathogenicity and virulence throughout Human pathogen diversity (Version: 1)

Supplementary dataset for The sociality of pathogenicity and virulence throughout Human pathogen diversity (Version: 1)

Kin selection explains the evolution of cooperation in the gut microbiota, by Simonet & McNally, 2020, Dataset S1 and codes for statistical analysis and figures production (Version: 1)

Kin selection explains the evolution of cooperation in the gut microbiota, by Simonet & McNally, 2020, Dataset S1 and codes for statistical analysis and figures production (Version: 2)

Kin selection explains the evolution of cooperation in the gut microbiota, by Simonet & McNally, 2020, Dataset S1 and codes for statistical analysis and figures production (Version: 2)

Automated Author ProfileSimonet, CamilleUniversity of Edinburgh0000-0003-1812-1794

Simonet, Camille

Current S-Index

Average Dataset Index per Dataset

Total Datasets

Average FAIR Score

Total Citations

Total Mentions

S-Index Interpretation

S-Index Over Time

Cumulative Citations Over Time

Cumulative Mentions Over Time

Datasets

Supplementary dataset for The sociality of pathogenicity and virulence throughout Human pathogen diversity (Version: 1)

Supplementary dataset for The sociality of pathogenicity and virulence throughout Human pathogen diversity (Version: 1)

Kin selection explains the evolution of cooperation in the gut microbiota, by Simonet &amp; McNally, 2020, Dataset S1 and codes for statistical analysis and figures production (Version: 1)

Kin selection explains the evolution of cooperation in the gut microbiota, by Simonet &amp; McNally, 2020, Dataset S1 and codes for statistical analysis and figures production (Version: 2)

Kin selection explains the evolution of cooperation in the gut microbiota, by Simonet &amp; McNally, 2020, Dataset S1 and codes for statistical analysis and figures production (Version: 2)

Automated Author Profile
Simonet, Camille
University of Edinburgh
0000-0003-1812-1794

Kin selection explains the evolution of cooperation in the gut microbiota, by Simonet & McNally, 2020, Dataset S1 and codes for statistical analysis and figures production (Version: 1)

Kin selection explains the evolution of cooperation in the gut microbiota, by Simonet & McNally, 2020, Dataset S1 and codes for statistical analysis and figures production (Version: 2)

Kin selection explains the evolution of cooperation in the gut microbiota, by Simonet & McNally, 2020, Dataset S1 and codes for statistical analysis and figures production (Version: 2)