4.7 Article

Polishing copy number variant calls on exome sequencing data via deep learning

期刊

GENOME RESEARCH
卷 32, 期 6, 页码 1170-1182

出版社

COLD SPRING HARBOR LAB PRESS, PUBLICATIONS DEPT
DOI: 10.1101/gr.274845.120

关键词

-

资金

  1. Turkiye Bilimler Akademisi ustun Baarl Genc Bilim nsan odulleri Program (TUBA GEBIP), Bilim Akademisi-Genc Bilim nsanlar odul Program
  2. TUSEB Aziz Sancar Research Incentive awards

向作者/读者索取更多资源

Accurate and efficient detection of copy number variants (CNVs) is crucial for studying complex genetic diseases. However, copy number detection on whole-exome sequencing (WES) data is less accurate compared to whole-genome sequencing (WGS) data. This study introduces a novel deep learning model, DECoNT, which improves the precision of CNV detection on WES data sets, regardless of sequencing technology, exome capture kit, and CNV caller.
Accurate and efficient detection of copy number variants (CNVs) is of critical importance owing to their significant association with complex genetic diseases. Although algorithms that use whole-genome sequencing (WGS) data provide stable results with mostly valid statistical assumptions, copy number detection on whole-exome sequencing (WES) data shows comparatively lower accuracy. This is unfortunate as WES data are cost-efficient, compact, and relatively ubiquitous. The bottleneck is primarily due to the noncontiguous nature of the targeted capture: biases in targeted genomic hybridization, GC content, targeting probes, and sample batching during sequencing. Here, we present a novel deep learning model, DECoNT, which uses the matched WES and WGS data, and learns to correct the copy number variations reported by any off-the-shelf WES-based germline CNV caller. We train DECoNT on the 1000 Genomes Project data, and we show that we can efficiently triple the duplication call precision and double the deletion call precision of the state-of-the-art algorithms. We also show that our model consistently improves the performance independent of (1) sequencing technology, (2) exome capture kit, and (3) CNV caller. Using DECoNT as a universal exome CNV call polisher has the potential to improve the reliability of germline CNV detection on WES data sets.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据