4.5 Article

Assessing computational predictions of the phenotypic effect of cystathionine-beta-synthase variants

Journal

HUMAN MUTATION
Volume 40, Issue 9, Pages 1530-1545

Publisher

WILEY
DOI: 10.1002/humu.23868

Keywords

CAGI challenge; critical assessment; cystathionine-beta-synthase; machine learning; phenotype prediction; single amino acid substitution

Funding

  1. National Institutes of Health [R01 GM120374, R01 GM079656, R01 LM009722, R01 GM071749, R13 HG006650, R01 GM114434, R01 GM066099, U01 GM115486, U41 HG007346]
  2. Fondazione Istituto di Ricerca Pediatrica - Citta della Speranza [18-04]
  3. Estonian Research Council [IUT34-12]
  4. Marie Curie International Outgoing Fellowship [PIOF-GA2009-237751]
  5. Ministry of Education, Universities and Research (MIUR)
  6. Italian Ministry of Health [GR-201102347754]
  7. Interuniversity Attraction Pole (IAP) Network from Science Policy of the Federal Government of Belgium (Belspo) [P7/16]
  8. National Science Foundation [ABI 1458359]
  9. University of Leuven (KUL)
  10. Flanders Institute for Biotechnology (VIB)
  11. Flanders Institute for Science and Technology (IWT)
  12. Funds for Scientific Research Flanders (FWO) [G.0509.13]

Ask authors/readers for more resources

Accurate prediction of the impact of genomic variation on phenotype is a major goal of computational biology and an important contributor to personalized medicine. Computational predictions can lead to a better understanding of the mechanisms underlying genetic diseases, including cancer, but their adoption requires thorough and unbiased assessment. Cystathionine-beta-synthase (CBS) is an enzyme that catalyzes the first step of the transsulfuration pathway, from homocysteine to cystathionine, and in which variations are associated with human hyperhomocysteinemia and homocystinuria. We have created a computational challenge under the CAGI framework to evaluate how well different methods can predict the phenotypic effect(s) of CBS single amino acid substitutions using a blinded experimental data set. CAGI participants were asked to predict yeast growth based on the identity of the mutations. The performance of the methods was evaluated using several metrics. The CBS challenge highlighted the difficulty of predicting the phenotype of an ex vivo system in a model organism when classification models were trained on human disease data. We also discuss the variations in difficulty of prediction for known benign and deleterious variants, as well as identify methodological and experimental constraints with lessons to be learned for future challenges.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available