4.8 Article

Improving the Accuracy of Demographic and Molecular Clock Model Comparison While Accommodating Phylogenetic Uncertainty

Journal

MOLECULAR BIOLOGY AND EVOLUTION
Volume 29, Issue 9, Pages 2157-2167

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/molbev/mss084

Keywords

model comparison; marginal likelihood; Bayes factors; path sampling; stepping-stone sampling; demographic models; molecular clock; Bayesian inference; phylogeny; BEAST

Funding

  1. European Union [278433]
  2. ERC [260864]
  3. National Institutes of Health [U54 RR024386-01A2, R01 GM086887, R01 HG006139, R01 NS063897]
  4. Wellcome Trust [WT092807MA]
  5. European Molecular Biology Organization
  6. National Evolutionary Synthesis Center through a working group (Software for Bayesian Evolutionary Analysis)

Ask authors/readers for more resources

Recent developments in marginal likelihood estimation for model selection in the field of Bayesian phylogenetics and molecular evolution have emphasized the poor performance of the harmonic mean estimator (HME). Although these studies have shown the merits of new approaches applied to standard normally distributed examples and small real-world data sets, not much is currently known concerning the performance and computational issues of these methods when fitting complex evolutionary and population genetic models to empirical real-world data sets. Further, these approaches have not yet seen widespread application in the field due to the lack of implementations of these computationally demanding techniques in commonly used phylogenetic packages. We here investigate the performance of some of these new marginal likelihood estimators, specifically, path sampling (PS) and stepping-stone (SS) sampling for comparing models of demographic change and relaxed molecular clocks, using synthetic data and real-world examples for which unexpected inferences were made using the HME. Given the drastically increased computational demands of PS and SS sampling, we also investigate a posterior simulation-based analogue of Akaike's information criterion (AIC) through Markov chain Monte Carlo (MCMC), a model comparison approach that shares with the HME the appealing feature of having a low computational overhead over the original MCMC analysis. We confirm that the HME systematically overestimates the marginal likelihood and fails to yield reliable model classification and show that the AICM performs better and may be a useful initial evaluation of model choice but that it is also, to a lesser degree, unreliable. We show that PS and SS sampling substantially outperform these estimators and adjust the conclusions made concerning previous analyses for the three real-world data sets that we reanalyzed. The methods used in this article are now available in BEAST, a powerful user-friendly software package to perform Bayesian evolutionary analyses.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available