期刊
MOLECULAR PHYLOGENETICS AND EVOLUTION
卷 134, 期 -, 页码 164-171出版社
ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.ympev.2019.02.012
关键词
Phylogenetic inference; Phylogenomics; Species trees; Concatenation; Coalescent; Supergene
资金
- NSF [DEB-1655571]
- University of Texas at Arlington Phi Sigma Society grant
Fundamental to all phylogenomic studies is the notion that increasing the amount of data - to entire genomes when possible - will increase the accuracy of phylogenetic inference. Simply adding more data does not, however, guarantee phylogenomic inferences will be more accurate. Even genome-scale reconstructions of species histories can suffer the effects of both incomplete lineage sorting (ILS) and gene tree estimation error (GTEE). Weighted statistical binning was originally proposed as a technique to assist the avian phylogenomics project in solving the bird tree of life, which has long eluded resolution as a result of both ILS and GTEE. These so-called statistical binning procedures seek to overcome GTEE by concatenating loci into longer multi-locus supergenes that are used to reconstruct a species tree under the assumption that the supergene tree set is an accurate estimate of the true underlying gene tree distribution. Here we evaluate the performance of the method using the original avian phylogenomics dataset. Our results suggest that statistical binning constructs false supergenes that concatenate loci with different coalescent histories more often than not > 92% of supergenes comprise discordant loci. Our results underscore a major logical inconsistency: GTEE - the sole justification for using statistical binning instead of standard concatenation - also makes these methods unreliable. These findings underscore the need for developing new robust frameworks for phylogenomic inference that more appropriately accommodate GTEE and ILS at a genome-wide scale.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据