Published on 01 January 2022
Absence of enterotypes in the human gut microbiomes reanalyzed with non-linear dimensionality reduction methods
View DatasetDescription
Here we provide data and software for the work "Absence of enterotypes in the human gut microbiomes reanalyzed with non-linear dimensionality reduction methods".
Enterotypes of the human gut microbiome have been proposed to be a powerful prognostic tool to evaluate the correlation between lifestyle, nutrition, and disease. However, the number of enterotypes suggested in the literature ranged from two to four. The growth of available metagenome data and the use of exact, non-linear methods of data analysis challenges the very concept of clusters in the multidimensional space of bacterial microbiomes.
We demonstrate the presence of a lower-dimensional structure in the microbiome space, with high-dimensional data concentrated near a low-dimensional non-linear submanifold, but the absence of distinct and stable clusters that could represent enterotypes. This observation is robust with regard to diverse combinations of dimensionality reduction techniques and clustering algorithms.
We used 16S rRNA genotype data from the National Institutes of Health Human Microbiome Project (HMP) and American Gut Project (AGP) presented in Order, Family, and Genus taxonomic levels (O, F, and G, respectively). These largest open-access available datasets provide a sufficient number of data points for correct estimation of the clustering partition and constructing a manifold. We used 3457 HMP samples from stool and rectum body sites downloaded from https://portal.hmpdacc.org/ and 9511 samples from AGP downloaded from https://figshare.com/ as abundance matrices.
All datasets were normalized by dividing the Operational Taxonomic Units (OTUs) values by the total sum of abundances for a given data sample.
Citations (0)
No citations found
Mentions (0)
No mentions found
Metrics Over Time
Publication Details
Subfield
Molecular Biology
Field
Biochemistry, Genetics and Molecular Biology
Domain
Life Sciences
Confidence Score
85%
Source
Open Alex