4.7 Article

Machine Learning Models for Genetic Risk Assessment of Infants with Non-syndromic Orofacial Cleft

期刊

GENOMICS PROTEOMICS & BIOINFORMATICS
卷 16, 期 5, 页码 354-364

出版社

ELSEVIER SCIENCE BV
DOI: 10.1016/j.gpb.2018.07.005

关键词

Orofacial cleft; Genetic risk; Folic acid; Vitamin A; Nutritional intervention

资金

  1. open project of Beijing Advanced Innovation Center for Food Nutrition and Human Health, China
  2. National Natural Science Foundation of China [81860370]
  3. Beijing Municipal Natural Science Foundation [7182184]
  4. interdisciplinary medicine Seed Fund of Peking University, China [BMU2017MB006]
  5. National Postdoctoral Program for Innovative Talents, China [BX201600150]

向作者/读者索取更多资源

The isolated type of orofacial cleft, termed non-syndromic cleft lip with or without cleft palate (NSCL/P), is the second most common birth defect in China, with Asians having the highest incidence in the world. NSCL/P involves multiple genes and complex interactions between genetic and environmental factors, imposing difficulty for the genetic assessment of the unborn fetus carrying multiple NSCL/P-susceptible variants. Although genome-wide association studies (GWAS) have uncovered dozens of single nucleotide polymorphism (SNP) loci in different ethnic populations, the genetic diagnostic effectiveness of these SNPs requires further experimental validation in Chinese populations before a diagnostic panel or a predictive model covering multiple SNPs can be built. In this study, we collected blood samples from control and NSCL/P infants in Han and Uyghur Chinese populations to validate the diagnostic effectiveness of 43 candidate SNPs previously detected using GWAS. We then built predictive models with the validated SNPs using different machine learning algorithms and evaluated their prediction performance. Our results showed that logistic regression had the best performance for risk assessment according to the area under curve. Notably, defective variants in MTHFR and RBP4, two genes involved in folic acid and vitamin A biosynthesis, were found to have high contributions to NSCL/P incidence based on feature importance evaluation with logistic regression. This is consistent with the notion that folic acid and vitamin A are both essential nutritional supplements for pregnant women to reduce the risk of conceiving an NSCL/P baby. Moreover, we observed a lower predictive power in Uyghur than in Han cases, likely due to differences in genetic background between these two ethnic populations. Thus, our study highlights the urgency to generate the HapMap for Uyghur population and perform resequencing-based screening of Uyghur-specific NSCL/P markers.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据