4.5 Article

Space efficient computation of rare maximal exact matches between multiple sequences

Journal

JOURNAL OF COMPUTATIONAL BIOLOGY
Volume 15, Issue 4, Pages 357-377

Publisher

MARY ANN LIEBERT, INC
DOI: 10.1089/cmb.2007.0105

Keywords

alignment; algorithms; strings; suffix trees.

Ask authors/readers for more resources

In this article, we propose a new method for computing rare maximal exact matches between multiple sequences. A rare match between k sequences S-1,...,S-k is a string that occurs at most t(i)-times in the sequence Si, where the t(i) > 0 are user-defined thresholds. First, the suffix tree of one of the sequences ( the reference sequence) is built, and then the other sequences are matched separately against this suffix tree. Second, the resulting pairwise exact matches are combined to multiple exact matches. A clever implementation of this method yields a very fast and space efficient program. This program can be applied in several comparative genomics tasks, such as the identification of synteny blocks between whole genomes.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available