4.7 Article Proceedings Paper

Database indexing for production MegaBLAST searches

期刊

BIOINFORMATICS
卷 24, 期 16, 页码 1757-1764

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btn322

关键词

-

资金

  1. Intramural NIH HHS Funding Source: Medline

向作者/读者索取更多资源

Motivation: The BLAST software package for sequence comparison speeds up homology search by preprocessing a query sequence into a lookup table. Numerous research studies have suggested that preprocessing the database instead would give better performance. However, production usage of sequence comparison methods that preprocess the database has been limited to programs such as BLAT and SSAHA that are designed to find matches when query and database subsequences are highly similar. Results: We developed a new version of the MegaBLAST module of BLAST that does the initial phase of finding short seeds for matches by searching a database index. We also developed a program makembindexthat preprocesses the database into a data structure for rapid seed searching. We show that the new indexed MegaBLAST is faster than the non-indexed version for most practical uses. We show that indexed MegaBLAST is faster than miBLAST, another implementation of BLAST nucleotide searching with a preprocessed database, for most of the 200 queries we tested. To deploy indexed MegaBLAST as part of NCBIsWeb BLAST service, the storage of databases and the queueing mechanism were modified, so that some machines are now dedicated to serving queries for a specific database. The response time for such Web queries is now faster than it was when each computer handled queries for multiple databases.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据