4.6 Article

PaSS: a sequencing simulator for PacBio sequencing

期刊

BMC BIOINFORMATICS
卷 20, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/s12859-019-2901-7

关键词

Third generation sequencing; Next generation sequencing; PacBio sequencing; Sequencing simulator; Sequencing error; Sequence pattern

资金

  1. National Natural Science Foundation of China [61472246]
  2. National Basic Research Program of China [2013CB956103]
  3. National High-Tech RD Program (863) [2014AA021502]
  4. Cross-Institute Research Fund of Shanghai Jiao Tong University [YG2017ZD01]

向作者/读者索取更多资源

BackgroundThird-generation sequencing platforms, such as PacBio sequencing, have been developed rapidly in recent years. PacBio sequencing generates much longer reads than the second-generation sequencing (or the next generation sequencing, NGS) technologies and it has unique sequencing error patterns. An effective read simulator is essential to evaluate and promote the development of new bioinformatics tools for PacBio sequencing data analysis.ResultsWe developed a new PacBio Sequencing Simulator (PaSS). It can learn sequence patterns from PacBio sequencing data currently available. In addition to the distribution of read lengths and error rates, we included a context-specific sequencing error model. Compared to existing PacBio sequencing simulators such as PBSIM, LongISLND and NPBSS, PaSS performed better in many aspects. Assembly tests also suggest that reads simulated by PaSS are the most similar to experimental sequencing data.ConclusionPaSS is an effective sequence simulator for PacBio sequencing. It will facilitate the evaluation and development of new analysis tools for the third-generation sequencing data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据