期刊
JOURNAL OF COMPUTATIONAL BIOLOGY
卷 15, 期 4, 页码 357-377出版社
MARY ANN LIEBERT, INC
DOI: 10.1089/cmb.2007.0105
关键词
alignment; algorithms; strings; suffix trees.
In this article, we propose a new method for computing rare maximal exact matches between multiple sequences. A rare match between k sequences S-1,...,S-k is a string that occurs at most t(i)-times in the sequence Si, where the t(i) > 0 are user-defined thresholds. First, the suffix tree of one of the sequences ( the reference sequence) is built, and then the other sequences are matched separately against this suffix tree. Second, the resulting pairwise exact matches are combined to multiple exact matches. A clever implementation of this method yields a very fast and space efficient program. This program can be applied in several comparative genomics tasks, such as the identification of synteny blocks between whole genomes.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据