4.8 Article

Predicting growth rate from gene expression

Publisher

NATL ACAD SCIENCES
DOI: 10.1073/pnas.1808080116

Keywords

biological networks; machine learning; data science; systems biology; metabolic networks

Funding

  1. NIH/National Institute of General Medical Sciences (NIGMS) [R01GM113238]
  2. NIH/National Cancer Institute [1U54CA193419]
  3. NSF-Graduate Research Fellowship Program Fund [DGE-0824162]
  4. NIH/NIGMS [5T32GM008382]

Ask authors/readers for more resources

Growth rate is one of the most important and most complex phenotypic characteristics of unicellular microorganisms, which determines the genetic mutations that dominate at the population level, and ultimately whether the population will survive. Translating changes at the genetic level to their growth-rate consequences remains a subject of intense interest, since such a mapping could rationally direct experiments to optimize antibiotic efficacy or bioreactor productivity. In this work, we directly map transcriptional profiles to growth rates by gathering published gene-expression data from Escherichia coli and Saccharomyces cerevisiae with corresponding growth-rate measurements. Using a machine-learning technique called k-nearest-neighbors regression, we build a model which predicts growth rate from gene expression. By exploiting the correlated nature of gene expression and sparsifying the model, we capture 81% of the variance in growth rate of the E. coli dataset, while reducing the number of features from >4,000 to 9. In S. cerevisiae, we account for 89% of the variance in growth rate, while reducing from >5,500 dimensions to 18. Such a model provides a basis for selecting successful strategies from among the combinatorial number of experimental possibilities when attempting to optimize complex phenotypic traits like growth rate.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available