4.6 Article

A framework for automated gene selection in genomic applications

期刊

GENETICS IN MEDICINE
卷 23, 期 10, 页码 1993-1997

出版社

ELSEVIER SCIENCE INC
DOI: 10.1038/s41436-021-01213-x

关键词

-

资金

  1. National Institutes of Health/National Heart, Lung, and Blood Institute [5R01HL143295]

向作者/读者索取更多资源

The study proposes an efficient framework for gene selection, categorizing genes into those with strong or established evidence and those with limited or emerging evidence of disease association. By extracting and filtering genes from multiple databases, a highly sensitive gene list is created, while maintaining dynamism and updatability.
Purpose An efficient framework to identify disease-associated genes is needed to evaluate genomic data for both individuals with an unknown disease etiology and those undergoing genomic screening. Here, we propose a framework for gene selection used in genomic analyses, including applications limited to genes with strong or established evidence levels and applications including genes with less or emerging evidence of disease association. Methods We extracted genes with evidence for gene-disease association from the Human Gene Mutation Database, OMIM, and ClinVar to build a comprehensive gene list of 6,145 genes. Next, we applied stringent filters in conjunction with computationally curated evidence (DisGeNET) to create a restrictive list limited to 3,929 genes with stronger disease associations. Results When compared to manual gene curation efforts, including the Clinical Genome Resource, genes with strong or definitive disease associations are included in both gene lists at high percentages, while genes with limited evidence are largely removed. We further confirmed the utility of this approach in identifying pathogenic and likely pathogenic variants in 45 genomes. Conclusion Our approach efficiently creates highly sensitive gene lists for genomic applications, while remaining dynamic and updatable, enabling time savings in genomic applications.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据