4.8 Article

SpeciesRax: A Tool for Maximum Likelihood Species Tree Inference from Gene Family Trees under Duplication, Transfer, and Loss

期刊

MOLECULAR BIOLOGY AND EVOLUTION
卷 39, 期 2, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/molbev/msab365

关键词

species tree inference; gene family tree; maximum likelihood; gene duplication; horizontal gene transfer; gene loss

资金

  1. Klaus Tschira Foundation
  2. DFG [STA 860/6-2]
  3. European Research Council under the European Union [714774, GINOP-2.3.2, -15-2016-00057]
  4. Royal Society University Fellowship
  5. NERC [NE/P00251X/1]
  6. Gordon and Betty Moore Foundation [GBMF9741]
  7. European Research Council (ERC) [714774] Funding Source: European Research Council (ERC)

向作者/读者索取更多资源

SpeciesRax is a maximum likelihood method that can infer a rooted species tree from a set of gene family trees and can account for gene duplication, loss, and transfer events. It leverages the phylogenetic rooting signal in gene trees and infers species tree branch lengths and support values through paralogy-aware quartets extracted from the gene family trees. It is faster and at least as accurate as the best competing methods.
Species tree inference from gene family trees is becoming increasingly popular because it can account for discordance between the species tree and the corresponding gene family trees. In particular, methods that can account for multiple-copy gene families exhibit potential to leverage paralogy as informative signal. At present, there does not exist any widely adopted inference method for this purpose. Here, we present SpeciesRax, the first maximum likelihood method that can infer a rooted species tree from a set of gene family trees and can account for gene duplication, loss, and transfer events. By explicitly modeling events by which gene trees can depart from the species tree, SpeciesRax leverages the phylogenetic rooting signal in gene trees. SpeciesRax infers species tree branch lengths in units of expected substitutions per site and branch support values via paralogy-aware quartets extracted from the gene family trees. Using both empirical and simulated data sets we show that SpeciesRax is at least as accurate as the best competing methods while being one order of magnitude faster on large data sets at the same time. We used SpeciesRax to infer a biologically plausible rooted phylogeny of the vertebrates comprising 188 species from 31,612 gene families in 1 h using 40 cores. SpeciesRax is available under GNU GPL at https://github.com/BenoitMorel/GeneRax and on BioCanda.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据