4.0 Article

Sequencing error profiles of Illumina sequencing instruments

期刊

NAR GENOMICS AND BIOINFORMATICS
卷 3, 期 1, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/nargab/lqab019

关键词

-

资金

  1. NHGRI [U41 HG006620]
  2. NSF ABI Grant [1661497]
  3. NIAID [R01 AI134384]
  4. Direct For Biological Sciences
  5. Div Of Biological Infrastructure [1661497] Funding Source: National Science Foundation

向作者/读者索取更多资源

This study developed a method to retrospectively determine the error rate of public sequencing datasets, finding that expensive platforms have lower error rates and less variation, but there is significant variation within each platform, with experiment accuracy depending greatly on the experimenter. The importance of sequence context and differences in sequence bias patterns between instruments were also highlighted.
Sequencing technology has achieved great advances in the past decade. Studies have previously shown the quality of specific instruments in controlled conditions. Here, we developed a method able to retroactively determine the error rate of most public sequencing datasets. To do this, we utilized the overlaps between reads that are a feature of many sequencing libraries. With this method, we surveyed 1943 different datasets from seven different sequencing instruments produced by Illumina. We show that among public datasets, the more expensive platforms like HiSeq and NovaSeq have a lower error rate and less variation. But we also discovered that there is great variation within each platform, with the accuracy of a sequencing experiment depending greatly on the experimenter. We show the importance of sequence context, especially the phenomenon where preceding bases bias the following bases toward the same identity. We also show the difference in patterns of sequence bias between instruments. Contrary to expectations based on the underlying chemistry, HiSeq X Ten and NovaSeq 6000 share notable exceptions to the preceding-base bias. Our results demonstrate the importance of the specific circumstances of every sequencing experiment, and the importance of evaluating the quality of each one.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.0
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据