Published on 24 February 2020 |

Version 0.2

Host-adaptation in Legionellales is 2.4 Gya, coincident with eukaryogenesis

View Dataset
Guy, Lionel;Ammunet, Tea;Hugoson, Eric

Description

This dataset contains genomes, proteomes and protein alignments mentioned in Hugoson et al (2019). It has been used to analyze the evolution of host-adaptation in the order Legionellales. The data is organized by dataset type, and then by dataset. The two datasets used here are Gamma105, comprising 105 Gammaproteobacteria and 5 outgroups, and Legio93, comprising 93 Legionellales and 20 outgroups. 1_genomes
Genomes as downloaded or assembled 1_1_Gamma105 1_2_Legio93 2_proteomes
Proteomes, as annotated by prokka 2_1_Gamma105 2_2_Legion93 3_alignments In each folder, the following files are found. All sequence and alignment files are in fasta format: *_concatenated.fasta: concatenated alignment, trimmed. *.map: map of the files, tab-separated. The first row is a title row. The three first columns give the organism, the marker and the id (as found in the fasta file) for the protein. unaligned: non-aligned sequences for each marker. *_aligned: aligned sequences, for each marker. The prefix gives the software used for the alignment. *_trimmed: aligned, trimmed sequences for each marker. The prefix gives the software used to trim the alignment. Folders: 3_1_Gamma105: Based on the Bact109 set of marker, used in Figure 4 and Supplementary Figures 2 and 7 3_2_Legio93, Based on the Bact109 set of marker, used in Figure 1 and Supplementary Figures 1 and 8 3_3_TB4SS_auto: Alignment of 12 genes of the T4BSS, automatically detected in all genomes. Used for the tree in Supplementary Figure 5. 3_4_TB4SS_manual: Alignment of 25 genes of the T4BSS, manually curated by collinearity analysis. Used for the tree in Supplementary Figure 5.

Citations (0)

Mentions (0)

Metrics

Dataset Index

1.9

FAIR Score

77%

Citations

0

Mentions

0

Metrics Over Time

Publication Details

DOI

Publisher

Zenodo

Assigned Domain

Subfield

Endocrinology

Field

Biochemistry, Genetics and Molecular Biology

Domain

Life Sciences

Confidence Score

100%

Source

Open Alex

Keywords

Molecular EvolutionPhylogenomics

Normalization Factors

FT

13.46

CTw

1.00

MTw

1.00