☆ 4.8 Article

Application of a superword array in genome assembly

NUCLEIC ACIDS RESEARCH (2006)

Journal

NUCLEIC ACIDS RESEARCH

Volume 34, Issue 1, Pages 201-205

Publisher

OXFORD UNIV PRESS

DOI: 10.1093/nar/gkj419

Keywords

Funding

NHGRI NIH HHS [U54 HG003079] Funding Source: Medline
NATIONAL HUMAN GENOME RESEARCH INSTITUTE [U54HG003079] Funding Source: NIH RePORTER

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

We introduce a data structure called a superword array for finding quickly matches between DNA sequences. The superword array possesses some desirable features of the lookup table and suffix array. We describe simple algorithms for constructing and using a superword array to find pairs of sequences that share a unique superword. The algorithms are implemented in a genome assembly program called PCAP.REP for computation of overlaps between reads. Experimental results produced by PCAP. REP and PCAP on a whole-genome dataset show that PCAP.REP produced a more accurate and contiguous assembly than PCAP.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8

Not enough ratings

Application of a superword array in genome assembly

Journal

NUCLEIC ACIDS RESEARCH

Publisher

OXFORD UNIV PRESS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Application of a superword array in genome assembly

Journal

NUCLEIC ACIDS RESEARCH

Publisher

OXFORD UNIV PRESS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper