4.8 Article

De novo sequencing and variant calling with nanopores using PoreSeq

Journal

NATURE BIOTECHNOLOGY
Volume 33, Issue 10, Pages 1087-+

Publisher

NATURE PUBLISHING GROUP
DOI: 10.1038/nbt.3360

Keywords

-

Funding

  1. National Institutes of Health [R01HG003703]

Ask authors/readers for more resources

The accuracy of sequencing single DNA molecules with nanopores is continually improving, but de novo genome sequencing and assembly using only nanopore data remain challenging. Here we describe PoreSeq, an algorithm that identifies and corrects errors in nanopore sequencing data and improves the accuracy of de novo genome assembly with increasing coverage depth. The approach relies on modeling the possible sources of uncertainty that occur as DNA transits through the nanopore and finds the sequence that best explains multiple reads of the same region. PoreSeq increases nanopore sequencing read accuracy of M13 bacteriophage DNA from 85% to 99% at 100x coverage. We also use the algorithm to assemble Escherichia coli with 30x coverage and the lambda genome at a range of coverages from 3x to 50x. Additionally, we classify sequence variants at an order of magnitude lower coverage than is possible with existing methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available