4.6 Article Proceedings Paper

Exploiting sparseness in de novo genome assembly

Journal

BMC BIOINFORMATICS
Volume 13, Issue -, Pages -

Publisher

BMC
DOI: 10.1186/1471-2105-13-S6-S1

Keywords

-

Funding

  1. Direct For Computer & Info Scie & Enginr
  2. Div Of Information & Intelligent Systems [0812111] Funding Source: National Science Foundation

Ask authors/readers for more resources

Background: The very large memory requirements for the construction of assembly graphs for de novo genome assembly limit current algorithms to super-computing environments. Methods: In this paper, we demonstrate that constructing a sparse assembly graph which stores only a small fraction of the observed k-mers as nodes and the links between these nodes allows the de novo assembly of even moderately-sized genomes (similar to 500 M) on a typical laptop computer. Results: We implement this sparse graph concept in a proof-of-principle software package, SparseAssembler, utilizing a new sparse k-mer graph structure evolved from the de Bruijn graph. We test our SparseAssembler with both simulated and real data, achieving similar to 90% memory savings and retaining high assembly accuracy, without sacrificing speed in comparison to existing de novo assemblers.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available