☆ 4.6 Article Proceedings Paper

Exploiting sparseness in de novo genome assembly

BMC BIOINFORMATICS (2012)

Journal

BMC BIOINFORMATICS

Volume 13, Issue -, Pages -

Publisher

BMC

DOI: 10.1186/1471-2105-13-S6-S1

Keywords

Funding

Direct For Computer & Info Scie & Enginr
Div Of Information & Intelligent Systems [0812111] Funding Source: National Science Foundation

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Background: The very large memory requirements for the construction of assembly graphs for de novo genome assembly limit current algorithms to super-computing environments. Methods: In this paper, we demonstrate that constructing a sparse assembly graph which stores only a small fraction of the observed k-mers as nodes and the links between these nodes allows the de novo assembly of even moderately-sized genomes (similar to 500 M) on a typical laptop computer. Results: We implement this sparse graph concept in a proof-of-principle software package, SparseAssembler, utilizing a new sparse k-mer graph structure evolved from the de Bruijn graph. We test our SparseAssembler with both simulated and real data, achieving similar to 90% memory savings and retaining high assembly accuracy, without sacrificing speed in comparison to existing de novo assemblers.

Exploiting sparseness in de novo genome assembly

Journal

BMC BIOINFORMATICS

Publisher

BMC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Exploiting sparseness in de novo genome assembly

Journal

BMC BIOINFORMATICS

Publisher

BMC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper