4.8 Article

Evolutionarily informed machine learning enhances the power of predictive gene-to-phenotype relationships

Journal

NATURE COMMUNICATIONS
Volume 12, Issue 1, Pages -

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41467-021-25893-w

Keywords

-

Funding

  1. National Science Foundation Plant Genome Research Program [IOS-1339362]
  2. USDA National Institute of Food and Agriculture Hatch project [1013620]
  3. USDA-NIFA predoctoral fellowship [2016-67011025167]
  4. NSF CompGen fellowship
  5. Jonathon Baldwin Turner graduate fellowship from the College of Agriculture, Consumer, and Environmental Sciences at the University of Illinois at Urbana-Champaign

Ask authors/readers for more resources

The study uses an evolutionarily informed machine learning approach to predict genes affecting nitrogen utilization in plant model organisms, and demonstrates the applicability of the method in mammalian systems.
Inferring phenotypic outcomes from genomic features is both a promise and challenge for systems biology. Using gene expression data to predict phenotypic outcomes, and functionally validating the genes with predictive powers are two challenges we address in this study. We applied an evolutionarily informed machine learning approach to predict phenotypes based on transcriptome responses shared both within and across species. Specifically, we exploited the phenotypic diversity in nitrogen use efficiency and evolutionarily conserved transcriptome responses to nitrogen treatments across Arabidopsis accessions and maize varieties. We demonstrate that using evolutionarily conserved nitrogen responsive genes is a biologically principled approach to reduce the feature dimensionality in machine learning that ultimately improved the predictive power of our gene-to-trait models. Further, we functionally validated seven candidate transcription factors with predictive power for NUE outcomes in Arabidopsis and one in maize. Moreover, application of our evolutionarily informed pipeline to other species including rice and mice models underscores its potential to uncover genes affecting any physiological or clinical traits of interest across biology, agriculture, or medicine. Predicting complex phenotypes from genomic information is still a challenge. Here, the authors use an evolutionarily informed machine learning approach within and across species to predict genes affecting nitrogen utilization in crops, and show their approach is also useful in mammalian systems.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available