4.7 Article

Statistical binning leads to profound model violation due to gene tree error incurred by trying to avoid gene tree error

期刊

MOLECULAR PHYLOGENETICS AND EVOLUTION
卷 134, 期 -, 页码 164-171

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.ympev.2019.02.012

关键词

Phylogenetic inference; Phylogenomics; Species trees; Concatenation; Coalescent; Supergene

资金

  1. NSF [DEB-1655571]
  2. University of Texas at Arlington Phi Sigma Society grant

向作者/读者索取更多资源

Fundamental to all phylogenomic studies is the notion that increasing the amount of data - to entire genomes when possible - will increase the accuracy of phylogenetic inference. Simply adding more data does not, however, guarantee phylogenomic inferences will be more accurate. Even genome-scale reconstructions of species histories can suffer the effects of both incomplete lineage sorting (ILS) and gene tree estimation error (GTEE). Weighted statistical binning was originally proposed as a technique to assist the avian phylogenomics project in solving the bird tree of life, which has long eluded resolution as a result of both ILS and GTEE. These so-called statistical binning procedures seek to overcome GTEE by concatenating loci into longer multi-locus supergenes that are used to reconstruct a species tree under the assumption that the supergene tree set is an accurate estimate of the true underlying gene tree distribution. Here we evaluate the performance of the method using the original avian phylogenomics dataset. Our results suggest that statistical binning constructs false supergenes that concatenate loci with different coalescent histories more often than not > 92% of supergenes comprise discordant loci. Our results underscore a major logical inconsistency: GTEE - the sole justification for using statistical binning instead of standard concatenation - also makes these methods unreliable. These findings underscore the need for developing new robust frameworks for phylogenomic inference that more appropriately accommodate GTEE and ILS at a genome-wide scale.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据