4.8 Article

A simple hierarchical approach to modeling distributions of substitution rates

Journal

MOLECULAR BIOLOGY AND EVOLUTION
Volume 22, Issue 2, Pages 223-234

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/molbev/msi009

Keywords

substitution rates; hierarchical model; adaptive evolution; hepatitis C; model selection; parallel algorithms

Ask authors/readers for more resources

Genetic sequence data typically exhibit variability in substitution rates across sites. In practice. there is often too hale, variation to fit a different rate for each site in the alignment. but the distribution of rates across sites may not be well modeled using simple parametric families. Mixtures of different distributions can capture more complex patterns of rate variation, but are often parameter-rich and difficult to fit. We present a simple hierarchical model in which a baseline rate distribution, such as a gamma distribution. is discretized into several categories, the quantiles of which are estimated using a discretized beta distribution. Although this approach involves adding only two extra parameters to a standard distribution, a wide range of rate distributions can be captured. Using simulated data, we demonstrate that a beta- model can reproduce the moments of the rate distribution more accurately than the distribution used to simulate the data. even when the baseline rate distribution is misspecified. Using hepatitis C virus and mammalian mitochondrial sequences, we show that a beta-model can fit as well or better than a model with multiple discrete rate categories. and compares favorably with a model which fits a separate rate category to each site. We also demonstrate this discretization scheme in the context of codon models specifically aimed at identifying individual sites undergoing adaptive or purifying evolution.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available