Published on 11 February 2025 |

Version V1

Pangenome reconstruction of Escherichia coli metabolism (E. coli panGEM - Full set) + codes and data for pangenome scale knock-out simulation

View Dataset
Ardalani, Omid

Description

*-- Rare Metabolic Gene Essentiality is a Determinant of Microniche Adaptation in Escherichia coli --*This repository contains Jupyter Notebooks and scripts for analyzing Escherichia coli pan-genome-scale metabolic models (panGEMs) to reproduce the results and figures presented in our manuscript. The primary objectives of these analyses are:Validation of GEMs using:Gene knock-out dataBioLog phenotyping dataSimulation of E. coli growth in major human body niches (feces, urine, and serum).Detection of available nutrients in each body site that support E. coli growth.Prediction of gene fitness scores across the panGEM using extensive gene knock-out simulations in feces, urine, serum, and M9 minimal media as a standard reference.Identification of rare essential genes across E. coli panGEM.panGEM Reconstruction & CurationFor reconstruction and curation of panGEMs, please refer to our previously published notebook available at:πŸ”— GitHub: EcopanGEMIf you use the scripts for panGEM reconstruction, please cite the following article:πŸ“„ Preprint: BioRxivPrediction of Fitness Scores Across Feces, Serum, Urine, and M9 MediaTo run the fitness score predictions, execute the following scripts in the terminal.πŸ’‘ Note:The fitness score prediction is computationally intensive.Scripts are optimized for high-performance computing environments (96-core virtual machines).Running these scripts on lower configurations may result in prolonged computation times.Scripts OverviewMedia SimulationπŸ“Œ media.py – Functions for simulating feces, serum, urine, and M9 minimal media.Flux Variability Analysis (FVA)πŸ“Œ fva_ecoli_biosamples.py – FVA across all reactions of all strains in feces, serum, and urine.πŸ“Œ upec_non_upec_feces_fva_comparison.py – FVA comparison between UPEC and non-UPEC strains.πŸ“Œ serum_eco.py – FVA across all GEMs on serum medium.πŸ“Œ urine_eco.py – FVA across all GEMs on urine medium.πŸ“Œ feces_eco.py – FVA across all GEMs on feces medium.Gene Knock-Out Fitness Score SimulationsπŸ“Œ fitness_rare_m9.py – Knock-out simulation on M9 medium.πŸ“Œ fitness_rare_feces.py – Knock-out simulation on feces medium.πŸ“Œ fitness_rare_serum.py – Knock-out simulation on serum medium.πŸ“Œ fitness_rare_urine.py – Knock-out simulation on urine medium.Concatenation of Knock-Out Simulation ResultsπŸ“Œ master_feces_concat.py – Merging results from feces knock-out simulations.πŸ“Œ master_m9_concat.py – Merging results from M9 knock-out simulations.πŸ“Œ master_serum_concat.py – Merging results from serum knock-out simulations.πŸ“Œ master_urine_concat.py – Merging results from urine knock-out simulations.Data AvailabilityAll data generated in this study are available in this repository, allowing users to reproduce the results without running simulations. This ensures easy validation and reanalysis of our findings.

Citations (0)

Mentions (0)

Metrics

Dataset Index

1.7

FAIR Score

69%

Citations

0

Mentions

0

Metrics Over Time

Publication Details

DOI

Publisher

Zenodo

Assigned Domain

Subfield

Endocrinology

Field

Biochemistry, Genetics and Molecular Biology

Domain

Life Sciences

Confidence Score

50%

Source

Scholar Data Model

Keywords

Pangempangenomepangenome scale metabolic modelE. coli

Normalization Factors

FT

13.46

CTw

1.00

MTw

1.00