4.5 Article

PASTA with many application-aware optimization criteria for alignment based phylogeny inference

Journal

COMPUTATIONAL BIOLOGY AND CHEMISTRY
Volume 98, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.compbiolchem.2022.107661

Keywords

Multiple sequence alignment; Phylogenetic tree; Multi-objective optimization

Funding

  1. ICT Doctoral Fellowship

Ask authors/readers for more resources

This paper introduces the application of the PASTA method in multiple sequence alignment and proposes a multi-objective framework called PMAO to improve the performance of PASTA by integrating multiple objectives related to the accuracy of the phylogenetic tree. Experimental results show that the tree-space generated by PMAO is better than using PASTA alone, and adding an additional component can generate smaller and higher quality solutions.
Multiple sequence alignment (MSA) is a prerequisite for several analyses in bioinformatics, such as, phylogeny estimation, protein structure prediction, etc. PASTA (Practical Alignments using SATe ' and TrAnsitivity) is a stateof-the-art method for computing MSAs, well-known for its accuracy and scalability. It iteratively co-estimates both MSA and maximum likelihood (ML) phylogenetic tree. It attempts to exploit the close association between the accuracy of an MSA and the corresponding tree while finding the output through multiple iterations from both directions. Currently, PASTA uses the ML score as its optimization criterion which is a good score in phylogeny estimation but cannot be proven as a necessary and sufficient criterion to produce an accurate phylogenetic tree. Therefore, the integration of multiple application-aware objectives into PASTA, which are carefully chosen considering their better association to the tree accuracy, may potentially have a profound positive impact on its performance. This paper has employed four application-aware objectives alongside ML score to develop a multi-objective (MO) framework, namely, PMAO that leverages PASTA to generate a bunch of high-quality solutions that are considered equivalent in the context of conflicting objectives under consideration. our experimental analysis on a popular biological benchmark reveals that the tree-space generated by PMAO contains significantly better trees than stand-alone PASTA. To help the domain experts further in choosing the most appropriate tree from the PMAO output (containing a relatively large set of high-quality solutions), we have added an additional component within the PMAO framework that is capable of generating a smaller set of highquality solutions. Finally, we have attempted to obtain a single high-quality solution without using any external evidences and have found that summarizing the few solutions detected through the above component can serve this purpose to some extent.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available