4.5 Article

PASTA with many application-aware optimization criteria for alignment based phylogeny inference

期刊

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.compbiolchem.2022.107661

关键词

Multiple sequence alignment; Phylogenetic tree; Multi-objective optimization

资金

  1. ICT Doctoral Fellowship

向作者/读者索取更多资源

This paper introduces the application of the PASTA method in multiple sequence alignment and proposes a multi-objective framework called PMAO to improve the performance of PASTA by integrating multiple objectives related to the accuracy of the phylogenetic tree. Experimental results show that the tree-space generated by PMAO is better than using PASTA alone, and adding an additional component can generate smaller and higher quality solutions.
Multiple sequence alignment (MSA) is a prerequisite for several analyses in bioinformatics, such as, phylogeny estimation, protein structure prediction, etc. PASTA (Practical Alignments using SATe ' and TrAnsitivity) is a stateof-the-art method for computing MSAs, well-known for its accuracy and scalability. It iteratively co-estimates both MSA and maximum likelihood (ML) phylogenetic tree. It attempts to exploit the close association between the accuracy of an MSA and the corresponding tree while finding the output through multiple iterations from both directions. Currently, PASTA uses the ML score as its optimization criterion which is a good score in phylogeny estimation but cannot be proven as a necessary and sufficient criterion to produce an accurate phylogenetic tree. Therefore, the integration of multiple application-aware objectives into PASTA, which are carefully chosen considering their better association to the tree accuracy, may potentially have a profound positive impact on its performance. This paper has employed four application-aware objectives alongside ML score to develop a multi-objective (MO) framework, namely, PMAO that leverages PASTA to generate a bunch of high-quality solutions that are considered equivalent in the context of conflicting objectives under consideration. our experimental analysis on a popular biological benchmark reveals that the tree-space generated by PMAO contains significantly better trees than stand-alone PASTA. To help the domain experts further in choosing the most appropriate tree from the PMAO output (containing a relatively large set of high-quality solutions), we have added an additional component within the PMAO framework that is capable of generating a smaller set of highquality solutions. Finally, we have attempted to obtain a single high-quality solution without using any external evidences and have found that summarizing the few solutions detected through the above component can serve this purpose to some extent.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据