4.7 Article

wQFM: highly accurate genome-scale species tree estimation from weighted quartets

期刊

BIOINFORMATICS
卷 37, 期 21, 页码 3734-3743

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btab428

关键词

-

资金

  1. Information and Communication Technology Division (ICT Division), Government of the People's Republic of Bangladesh

向作者/读者索取更多资源

Estimating species trees from genes sampled from the whole genome is challenging due to gene tree-species tree discordance, with incomplete lineage sorting being a common cause. Quartet-based weighted methods offer a statistically consistent way for accurate species tree estimation in such cases. The proposed wQFM method extends the quartet FM algorithm to a weighted setting, providing highly accurate species tree estimation results on simulated and real biological datasets.
Motivation: Species tree estimation from genes sampled from throughout the whole genome is complicated due to the gene tree-species tree discordance. Incomplete lineage sorting (ILS) is one of the most frequent causes for this discordance, where alleles can coexist in populations for periods that may span several speciation events. Quartet-based summary methods for estimating species trees from a collection of gene trees are becoming popular due to their high accuracy and statistical guarantee under ILS. Generating quartets with appropriate weights, where weights correspond to the relative importance of quartets, and subsequently amalgamating the weighted quartets to infer a single coherent species tree can allow for a statistically consistent way of estimating species trees. However, handling weighted quartets is challenging. Results: We propose wQFM, a highly accurate method for species tree estimation from multi-locus data, by extending the quartet FM (QFM) algorithm to a weighted setting. wQFM was assessed on a collection of simulated and real biological datasets, including the avian phylogenomic dataset, which is one of the largest phylogenomic datasets to date. We compared wQFM with wQMC, which is the best alternate method for weighted quartet amalgamation, and with ASTRAL, which is one of the most accurate and widely used coalescent-based species tree estimation methods. Our results suggest that wQFM matches or improves upon the accuracy of wQMC and ASTRAL. Supplementary information: Supplementary data are available at Bioinformatics online.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据