Published on 01 January 2020

Pan-genome of 10,000 E. coli isolates

View Dataset
Horesh, Gal;Heinz, Eva;Thomson, Nick

Description

The complete metadata of 10,146 high quality E. coli genomes isolated from human hosts (F1).Description and complete profiling of 50 E. coli lineages which represent the majority of publicly available human-isolated E. coli genomes (F2). Phylogenetic trees presented in the manuscript (with 500 genomes and with 50 genomes).The complete pan-genome of the collection which includes: A FASTA file containing the representative sequence of each gene of the gene pool (F3).Complete gene presence-absence across all isolates (F4).The frequency of each gene within each of the lineages (F5)Representative sequences from each lineage of the final set of genes in the gene-pool (i.e. a representative sequence from each lineage) (F6)

Citations (0)

Mentions (0)

Metrics

Dataset Index

0.3

FAIR Score

13%

Citations

0

Mentions

0

Metrics Over Time

Publication Details

DOI

Publisher

figshare

Assigned Domain

Subfield

Genetics

Field

Biochemistry, Genetics and Molecular Biology

Domain

Life Sciences

Confidence Score

89%

Source

Open Alex

Keywords

Computational Biology60503 Microbial GeneticsFOS: Biological sciencesMicrobiology

Normalization Factors

FT

13.46

CTw

1.00

MTw

1.00