4.5 Article

Stochastic models and descriptive statistics for phylogenetic trees, from Yule to today

Journal

STATISTICAL SCIENCE
Volume 16, Issue 1, Pages 23-34

Publisher

INST MATHEMATICAL STATISTICS
DOI: 10.1214/ss/998929474

Keywords

descriptive statistics; phylogenetic tree; stochastic model; tree balance; Yule process

Ask authors/readers for more resources

In 1924 Yule observed that distributions of number of species per genus were typically long-tailed, and proposed a stochastic model to fit these data. Modern taxonomists often prefer to represent relationships between species via phylogenetic trees; the counterpart to Yule's observation is that actual reconstructed trees look surprisingly unbalanced. The imbalance can readily be seen via a scatter diagram of the sizes of clades involved in the splits of published large phylogenetic trees. Attempting stochastic modeling leads to two puzzles. First, two somewhat opposite possible biological descriptions of what dominates the macroevolutionary process (adaptive radiation; neutral evolution) lead to exactly the same mathematical model (Marhov or Yule or coalescent). Second, neither this nor any other simple stochastic model predicts the observed pattern of imbalance. This essay represents a probabilist's musings on these puzzles, complementing the more detailed survey of biological literature.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available