4.5 Article

Characterization of background noise in capture-based targeted sequencing data

期刊

GENOME BIOLOGY
卷 18, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/s13059-017-1275-2

关键词

Next-generation sequencing; Targeted deep sequencing; Substitution rate; Background error; DNA fragmentation; Plasma DNA

资金

  1. Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI) - Ministry of Health & Welfare, Republic of Korea [HI13C2096]
  2. Ministry of Food & Drug Safety, Republic of Korea [16173MFDS004]

向作者/读者索取更多资源

Background: Targeted deep sequencing is increasingly used to detect low-allelic fraction variants; it is therefore essential that errors that constitute baseline noise and impose a practical limit on detection are characterized. In the present study, we systematically evaluate the extent to which errors are incurred during specific steps of the capture-based targeted sequencing process. Results: We removed most sequencing artifacts by filtering out low-quality bases and then analyze the remaining background noise. By recognizing that plasma DNA is naturally fragmented to be of a size comparable to that of mono-nucleosomal DNA, we were able to identify and characterize errors that are specifically associated with acoustic shearing. Two-thirds of C: G > A: T errors and one quarter of C: G > G: C errors were attributed to the oxidation of guanine during acoustic shearing, and this was further validated by comparative experiments conducted under different shearing conditions. The acoustic shearing step also causes A > G and A > T substitutions localized to the end bases of sheared DNA fragments, indicating a probable association of these errors with DNA breakage. Finally, the hybrid selection step contributes to one-third of the remaining C: G > A: T and one-fifth of the C > T errors. Conclusions: The results of this study provide a comprehensive summary of various errors incurred during targeted deep sequencing, and their underlying causes. This information will be invaluable to drive technical improvements in this sequencing method, and may increase the future usage of targeted deep sequencing methods for low-allelic fraction variant detection.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据