4.6 Article

Genetic dissection of complex traits using hierarchical biological knowledge

Journal

PLOS COMPUTATIONAL BIOLOGY
Volume 17, Issue 9, Pages -

Publisher

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pcbi.1009373

Keywords

-

Funding

  1. National Institutes of Health [R01 HG009979, P41 GM103504, P50 DA037844]

Ask authors/readers for more resources

Using a machine learning system and hierarchical biological knowledge, researchers found associations between genetic mutations and specific phenotypic traits; although mostly novel loci were identified, they still failed to explain all heritable variations; this approach demonstrates the potential of amplifying and interpreting signals in population genetic studies.
Despite the growing constellation of genetic loci linked to common traits, these loci have yet to account for most heritable variation, and most act through poorly understood mechanisms. Recent machine learning (ML) systems have used hierarchical biological knowledge to associate genetic mutations with phenotypic outcomes, yielding substantial predictive power and mechanistic insight. Here, we use an ontology-guided ML system to map single nucleotide variants (SNVs) focusing on 6 classic phenotypic traits in natural yeast populations. The 29 identified loci are largely novel and account for similar to 17% of the phenotypic variance, versus <3% for standard genetic analysis. Representative results show that sensitivity to hydroxyurea is linked to SNVs in two alternative purine biosynthesis pathways, and that sensitivity to copper arises through failure to detoxify reactive oxygen species in fatty acid metabolism. This work demonstrates a knowledge-based approach to amplifying and interpreting signals in population genetic studies.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available