4.6 Article

The Occurrence Birth-Death Process for Combined-Evidence Analysis in Macroevolution and Epidemiology

Journal

SYSTEMATIC BIOLOGY
Volume 71, Issue 6, Pages 1440-1452

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/sysbio/syac037

Keywords

-

Funding

  1. Eidgenossische Technische Hochschule Zurich

Ask authors/readers for more resources

Phylodynamic models aim to infer phylogenetic relationships, model parameters, and the number of lineages over time based on molecular sequence data. Recent developments include the integration of molecular and morphological data in a unified Bayesian inference framework, as well as new methodological developments in incorporating occurrence data and estimating lineage numbers. This research has practical applications in epidemiology and macroevolution, and has the potential to bridge the gap between traditional epidemiology and pathogen genomics, as well as paleontology and molecular phylogenetics.
Phylodynamic models generally aim at jointly inferring phylogenetic relationships, model parameters, and more recently, the number of lineages through time, based on molecular sequence data. In the fields of epidemiology and macroevolution, these models can be used to estimate, respectively, the past number of infected individuals (prevalence) or the past number of species (paleodiversity) through time. Recent years have seen the development of total-evidence analyses, which combine molecular and morphological data from extant and past sampled individuals in a unified Bayesian inference framework. Even sampled individuals characterized only by their sampling time, that is, lacking morphological and molecular data, which we call occurrences, provide invaluable information to estimate the past number of lineages. Here, we present new methodological developments around the fossilized birth-death process enabling us to (i) incorporate occurrence data in the likelihood function; (ii) consider piecewise-constant birth, death, and sampling rates; and (iii) estimate the past number of lineages, with or without knowledge of the underlying tree. We implement our method in the RevBayes software environment, enabling its use along with a large set of models of molecular and morphological evolution, and validate the inference workflow using simulations under a wide range of conditions. We finally illustrate our new implementation using two empirical data sets stemming from the fields of epidemiology and macroevolution. In epidemiology, we infer the prevalence of the coronavirus disease 2019 outbreak on the Diamond Princess ship, by taking into account jointly the case count record (occurrences) along with viral sequences for a fraction of infected individuals. In macroevolution, we infer the diversity trajectory of cetaceans using molecular and morphological data from extant taxa, morphological data from fossils, as well as numerous fossil occurrences. The joint modeling of occurrences and trees holds the promise to further bridge the gap between traditional epidemiology and pathogen genomics, as well as paleontology and molecular phylogenetics. [Birth-death model; epidemiology; fossils; macroevolution; occurrences; phylogenetics; skyline.]

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available