期刊
BMC GENOMICS
卷 17, 期 -, 页码 -出版社
BMC
DOI: 10.1186/s12864-016-3217-x
关键词
Illumina; Sequencing; Multiplexing; Sequencing errors
资金
- Simons Foundation
- Targeted Grant in the Mathematical Modeling of Living Systems Award [342039]
- National Science Foundation [DEB 1457518]
- National Institute of Food and Agriculture, US Department of Agriculture, Hatch project [1006261]
- Division Of Environmental Biology
- Direct For Biological Sciences [1457518] Funding Source: National Science Foundation
Background: Multiplexing multiple samples during Illumina sequencing is a common practice and is rapidly growing in importance as the throughput of the platform increases. Misassignments during de-multiplexing, where sequences are associated with the wrong sample, are an overlooked error mode on the Illumina sequencing platform. This results in a low rate of cross-talk among multiplexed samples and can cause detrimental effects in studies requiring the detection of rare variants or when multiplexing a large number of samples. Results: We observed rates of cross-talk averaging 0.24 % when multiplexing 14 different samples with unique i5 and i7 index sequences. This cross-talk rate corresponded to 254,632 misassigned reads on a single lane of the Illumina HiSeq 2500. Notably, all types of misassignment occur at similar rates: incorrect i5, incorrect i7, and incorrect sequence reads. We demonstrate that misassignments can be nearly eliminated by quality filtering of index reads while preserving about 90 % of the original sequences. Conclusions: Cross-talk among multiplexed samples is a significant error mode on the Illumina platform, especially if samples are only separated by a single unique index. Quality filtering of index sequences offers an effective solution to minimizing cross-talk among samples. Furthermore, we propose a straightforward method for verifying the extent of cross-talk between samples and optimizing quality score thresholds that does not require additional control samples and can even be performed post hoc on previous runs.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据