4.4 Article

The large-sample asymptotic behaviour of quartet-based summary methods for species tree inference

期刊

JOURNAL OF MATHEMATICAL BIOLOGY
卷 85, 期 3, 页码 -

出版社

SPRINGER HEIDELBERG
DOI: 10.1007/s00285-022-01786-4

关键词

Species tree; Asymptotic behaviour; Sample complexity; Multispecies coalescent

资金

  1. Agence Nationale de la Recherche [ANR-19-CE45-0012]
  2. Agence Nationale de la Recherche (ANR) [ANR-19-CE45-0012] Funding Source: Agence Nationale de la Recherche (ANR)

向作者/读者索取更多资源

This paper studies a statistical method to infer a species tree from a set of gene trees. The results show that the error probability decays exponentially as the number of input gene trees increases. A closed form for the error probability is derived for a four-taxon species tree, and improved upper bounds for the sample complexity are obtained.
methods seek to infer a species tree from a set of gene trees. A desirable property of such methods is that of statistical consistency; that is, the probability of inferring the wrong species tree (the error probability) tends to 0 as the number of input gene trees becomes large. A popular paradigm is to infer a species tree that agrees with the maximum number of quartets from the input set of gene trees; this has been proved to be statistically consistent under several models of gene evolution. In this paper, we study the asymptotic behaviour of the error probability of such methods in this limit, and show that it decays exponentially. For a 4-taxon species tree, we derive a closed form for the asymptotic behaviour in terms of the probability that the gene evolution process produces the correct topology. We also derive bounds for the sample complexity (the number of gene trees required to infer the true species tree with a given probability), which outperform existing bounds. We then extend our results to bounds for the asymptotic behaviour of the error probability for any species tree, and compare these to the true error probability for some model species trees using simulations.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据