4.7 Article

Phenome risk classification enables phenotypic imputation and gene discovery in developmental stuttering

期刊

AMERICAN JOURNAL OF HUMAN GENETICS
卷 108, 期 12, 页码 2271-2283

出版社

CELL PRESS
DOI: 10.1016/j.ajhg.2021.11.004

关键词

-

资金

  1. National Institute on Deafness and Other Communication Disorders (NIDCD), National Institutes of Health [R03DC015329, R01DC017175, R21DC016723]
  2. NIDCD [5T32GM80178-13]
  3. NIH
  4. Clinical and Trans-lational Science Awards [UL1TR002243, UL1TR000445, UL1RR024975]
  5. [U01HG004798]
  6. [R01NS032830]
  7. [RC2GM092618]
  8. [P50GM115305]
  9. [U01HG006378]
  10. [U19HL065962]
  11. [R01HD074711]

向作者/读者索取更多资源

Developmental stuttering is a speech disorder with a prevalence of 6-12%, but only 0.15% of individuals are diagnosed within electronic health records. A model called PheML was used to identify 9,239 affected individuals for genetic analysis, revealing genetic variants associated with stuttering in ancestry-stratified analysis.
Developmental stuttering is a speech disorder characterized by disruption in the forward movement of speech. This disruption includes part-word and single-syllable repetitions, prolongations, and involuntary tension that blocks syllables and words, and the disorder has a life-time prevalence of 6-12%. Within Vanderbilt's electronic health record (EHR)-linked biorepository (BioVU), only 142 individuals out of 92,762 participants (0.15%) are identified with diagnostic ICD9/10 codes, suggesting a large portion of people who stutter do not have a record of diagnosis within the EHR. To identify individuals affected by stuttering within our EHR, we built a PheCode-driven Gini impurity-based classification and regression tree model, PheML, by using comorbidities enriched in individuals affected by stuttering as predicting features and imputing stuttering status as the outcome variable. Applying PheML in BioVU identified 9,239 genotyped affected individuals (a clinical prevalence of similar to 10%) for downstream genetic analysis. Ancestry-stratified GWAS of PheML-imputed affected individuals and matched control individuals identified rs12613255, a variant near CYRIA on chromosome 2 (B = 0.323; p value =1.31 x 10(-8)) in European-ancestry analysis and rs7837758 (B = 0.518; p value = 5.07 x 10(-8)), an intronic variant found within the ZMAT4 gene on chromosome 8, in African-ancestry analysis. Polygenic-risk prediction and concordance analysis in an independent clinically ascertained sample of developmental stuttering cases validate our GWAS findings in PheML-imputed affected and control individuals and demonstrate the clinical relevance of our population-based analysis for stuttering risk.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据