4.8 Article

Optimized filtering reduces the error rate in detecting genomic variants by short-read sequencing

期刊

NATURE BIOTECHNOLOGY
卷 30, 期 1, 页码 61-U103

出版社

NATURE PUBLISHING GROUP
DOI: 10.1038/nbt.2053

关键词

-

资金

  1. Fund for Scientific Research Flanders (FWO-F)
  2. Agency for Innovation by Science and Technology (IWT)
  3. Stichting tegen Kanker, FWO-F
  4. KULeuven [KULPFV/10/016-SymBioSysII]

向作者/读者索取更多资源

Distinguishing single-nucleotide variants (SNVs) from errors in whole-genome sequences remains challenging. Here we describe a set of filters, together with a freely accessible software tool, that selectively reduce error rates and thereby facilitate variant detection in data from two short-read sequencing technologies, Complete Genomics and Illumina. By sequencing the nearly identical genomes from monozygotic twins and considering shared SNVs as 'true variants' and discordant SNVs as 'errors', we optimized thresholds for 12 individual filters and assessed which of the 1,048 filter combinations were effective in terms of sensitivity and specificity. Cumulative application of all effective filters reduced the error rate by 290-fold, facilitating the identification of genetic differences between monozygotic twins. We also applied an adapted, less stringent set of filters to reliably identify somatic mutations in a highly rearranged tumor and to identify variants in the NA19240 HapMap genome relative to a reference set of SNVs.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据