4.4 Article

Identifiability of species network topologies from genomic sequences using the logDet distance

期刊

JOURNAL OF MATHEMATICAL BIOLOGY
卷 84, 期 5, 页码 -

出版社

SPRINGER HEIDELBERG
DOI: 10.1007/s00285-022-01734-2

关键词

Species network; Identifiability; LogDet; Phylogenetic inference

资金

  1. National Institutes of Health [R01 GM117590]
  2. National Institutes of Health, an NIGMS Institutional Development Award (IDeA) [2P20GM103395]
  3. National Science Foundation [2051760]
  4. Moore-Simons Project on the Origin of the Eukaryotic Cell, Simons Foundation [735923LPI]
  5. Direct For Mathematical & Physical Scien
  6. Division Of Mathematical Sciences [2051760] Funding Source: National Science Foundation

向作者/读者索取更多资源

Inference of network-like evolutionary relationships between species from genomic data must consider both gene flow and incomplete lineage sorting. Standard methods have high computational demands and limit the size of analyzed datasets. This study shows that logDet distances computed from genomic-scale sequences can efficiently recover network relationships in the level-1 ultrametric case. It applies to both unlinked site data and sequence data.
Inference of network-like evolutionary relationships between species from genomic data must address the interwoven signals from both gene flow and incomplete lineage sorting. The heavy computational demands of standard approaches to this problem severely limit the size of datasets that may be analyzed, in both the number of species and the number of genetic loci. Here we provide a theoretical pointer to more efficient methods, by showing that logDet distances computed from genomic-scale sequences retain sufficient information to recover network relationships in the level-1 ultrametric case. This result is obtained under the Network Multispecies Coalescent model combined with a mixture of General Time-Reversible sequence evolution models across individual gene trees. It applies to both unlinked site data, such as for SNPs, and to sequence data in which many contiguous sites may have evolved on a common tree, such as concatenated gene sequences. Thus under standard stochastic models statistically justifiable inference of network relationships from sequences can be accomplished without consideration of individual genes or gene trees.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据