4.7 Article

DeepCNV: a deep learning approach for authenticating copy number variations

期刊

BRIEFINGS IN BIOINFORMATICS
卷 22, 期 5, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbaa381

关键词

copy number variation; deep learning

资金

  1. Children's Hospital of Philadelphia Endowed Chair in Genomic Research (NHGRI) [U01-HG006830]
  2. Children's Hospital of Philadelphia (Institutional Development Fund to the Center for Applied Genomics)
  3. Extreme Science and Engineering Discovery Environment (XSEDE) [National Science Foundation] [CIE170034, ACI-1548562]

向作者/读者索取更多资源

Copy number variations (CNVs) are important in disease pathogenesis, but detection and validation remain challenging. DeepCNV, a deep learning-based tool, improves CNV call accuracy and reduces false positives and failures in CNV-disease association results.
Copy number variations (CNVs) are an important class of variations contributing to the pathogenesis of many disease phenotypes. Detecting CNVs from genomic data remains difficult, and the most currently applied methods suffer from an unacceptably high false positive rate. A common practice is to have human experts manually review original CNV calls for filtering false positives before further downstream analysis or experimental validation. Here, we propose DeepCNV, a deep learning-based tool, intended to replace human experts when validating CNV calls, focusing on the calls made by one of the most accurate CNV callers, PennCNV. The sophistication of the deep neural network algorithm is enriched with over 10 000 expert-scored samples that are split into training and testing sets. Variant confidence, especially for CNVs, is a main roadblock impeding the progress of linking CNVs with the disease. We show that DeepCNV adds to the confidence of the CNV calls with an optimal area under the receiver operating characteristic curve of 0.909, exceeding other machine learning methods. The superiority of DeepCNV was also benchmarked and confirmed using an experimental wet-lab validation dataset. We conclude that the improvement obtained by DeepCNV results in significantly fewer false positive results and failures to replicate the CNV association results.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据