4.8 Article

Bayesian Selection of Nucleotide Substitution Models and Their Site Assignments

期刊

MOLECULAR BIOLOGY AND EVOLUTION
卷 30, 期 3, 页码 669-688

出版社

OXFORD UNIV PRESS
DOI: 10.1093/molbev/mss258

关键词

across-site rate variation; Dirichlet process mixture model; Bayesian model selection

资金

  1. Marsden Fund [UOA0809]
  2. Rutherford Discovery Fellowship
  3. University of Auckland Doctoral Scholarship
  4. [NIH R01 GM086887]
  5. [R01 HG006139.]

向作者/读者索取更多资源

Probabilistic inference of a phylogenetic tree from molecular sequence data is predicated on a substitution model describing the relative rates of change between character states along the tree for each site in the multiple sequence alignment. Commonly, one assumes that the substitution model is homogeneous across sites within large partitions of the alignment, assigns these partitions a priori, and then fixes their underlying substitution model to the best-fitting model from a hierarchy of named models. Here, we introduce an automatic model selection and model averaging approach within a Bayesian framework that simultaneously estimates the number of partitions, the assignment of sites to partitions, the substitution model for each partition, and the uncertainty in these selections. This new approach is implemented as an add-on to the BEAST 2 software platform. We find that this approach dramatically improves the fit of the nucleotide substitution model compared with existing approaches, and we show, using a number of example data sets, that as many as nine partitions are required to explain the heterogeneity in nucleotide substitution process across sites in a single gene analysis. In some instances, this improved modeling of the substitution process can have a measurable effect on downstream inference, including the estimated phylogeny, relative divergence times, and effective population size histories.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据