4.6 Article

MergeAlign: improving multiple sequence alignment performance by dynamic reconstruction of consensus multiple sequence alignments

Journal

BMC BIOINFORMATICS
Volume 13, Issue -, Pages -

Publisher

BIOMED CENTRAL LTD
DOI: 10.1186/1471-2105-13-117

Keywords

-

Funding

  1. BBSRC [BB/D020190/1]
  2. Biotechnology and Biological Sciences Research Council [BB/D020190/1] Funding Source: researchfish
  3. Natural Environment Research Council [MBA010001] Funding Source: researchfish
  4. BBSRC [BB/D020190/1] Funding Source: UKRI
  5. NERC [MBA010001] Funding Source: UKRI

Ask authors/readers for more resources

Background: The generation of multiple sequence alignments (MSAs) is a crucial step for many bioinformatic analyses. Thus improving MSA accuracy and identifying potential errors in MSAs is important for a wide range of post-genomic research. We present a novel method called MergeAlign which constructs consensus MSAs from multiple independent MSAs and assigns an alignment precision score to each column. Results: Using conventional benchmark tests we demonstrate that on average MergeAlign MSAs are more accurate than MSAs generated using any single matrix of sequence substitution. We show that MergeAlign column scores are related to alignment precision and hence provide an ab initio method of estimating alignment precision in the absence of curated reference MSAs. Using two novel and independent alignment performance tests that utilise a large set of orthologous gene families we demonstrate that increasing MSA performance leads to an increase in the performance of downstream phylogenetic analyses. Conclusion: Using multiple tests of alignment performance we demonstrate that this novel method has broad general application in biological research.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available