期刊
NUCLEIC ACIDS RESEARCH
卷 34, 期 1, 页码 201-205出版社
OXFORD UNIV PRESS
DOI: 10.1093/nar/gkj419
关键词
-
资金
- NHGRI NIH HHS [U54 HG003079] Funding Source: Medline
- NATIONAL HUMAN GENOME RESEARCH INSTITUTE [U54HG003079] Funding Source: NIH RePORTER
We introduce a data structure called a superword array for finding quickly matches between DNA sequences. The superword array possesses some desirable features of the lookup table and suffix array. We describe simple algorithms for constructing and using a superword array to find pairs of sequences that share a unique superword. The algorithms are implemented in a genome assembly program called PCAP.REP for computation of overlaps between reads. Experimental results produced by PCAP. REP and PCAP on a whole-genome dataset show that PCAP.REP produced a more accurate and contiguous assembly than PCAP.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据