4.6 Article

The Influence of Gene Flow on Species Tree Estimation: A Simulation Study

期刊

SYSTEMATIC BIOLOGY
卷 63, 期 1, 页码 17-30

出版社

OXFORD UNIV PRESS
DOI: 10.1093/sysbio/syt049

关键词

*BEAST; BEST; coalescence; compression; dilation; introgression; MPEST; migration; simulation

资金

  1. National Science Foundation [DBI-0805455, DBI-1144630]
  2. Royalty Research Fund (UW) [A61649]
  3. Biotechnological and Biological Sciences Research Council grant (UK)
  4. Royal Society/Wolfson Research Merit Award
  5. BBSRC [BB/K000896/1] Funding Source: UKRI
  6. Biotechnology and Biological Sciences Research Council [BB/K000896/1] Funding Source: researchfish

向作者/读者索取更多资源

Gene flow among populations or species and incomplete lineage sorting (ILS) are two evolutionary processes responsible for generating gene tree discordance and therefore hindering species tree estimation. Numerous studies have evaluated the impacts of ILS on species tree inference, yet the ramifications of gene flow on species trees remain less studied. Here, we simulate and analyse multilocus sequence data generated with ILS and gene flow to quantify their impacts on species tree inference. We characterize species tree estimation errors under various models of gene flow, such as the isolation-migration model, the n-island model, and gene flow between non-sister species or involving ancestral species, and species boundaries crossed by a single gene copy (allelic introgression) or by a single migrant individual. These patterns of gene flow are explored on species trees of different sizes (4 vs. 10 species), at different time scales (shallow vs. deep), and with different migration rates. Species trees are estimated with the multispecies coalescent model using Bayesian methods (BEST and *BEAST) and with a summary statistic approach (MPEST) that facilitates phylogenomic-scale analysis. Even in cases where the topology of the species tree is estimated with high accuracy, we find that gene flow can result in overestimates of population sizes (species tree dilation) and underestimates of species divergence times (species tree compression). Signatures of migration events remain present in the distribution of coalescent times for gene trees, and with sufficient data it is possible to identify those loci that have crossed species boundaries. These results highlight the need for careful sampling design in phylogeographic and species delimitation studies as gene flow, introgression, or incorrect sample assignments can bias the estimation of the species tree topology and of parameter estimates such as population sizes and divergence times.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据