☆ 4.6 Article

Pitfalls of heterogeneous processes for phylogenetic reconstruction

SYSTEMATIC BIOLOGY (2007)

Journal

SYSTEMATIC BIOLOGY

Volume 56, Issue 1, Pages 113-124

Publisher

TAYLOR & FRANCIS INC

DOI: 10.1080/10635150701245388

Keywords

inconsistency of likelihood; linear invariants; Markov chain; mixture models; Monte Carlo; non-identifiability; phylogenetic invariants; phyogenetics; rate variation; tree identifiability

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Different genes often have different phylogenetic histories. Even within regions having the same phylogenetic history, the mutation rates often vary. We investigate the prospects of phylogenetic reconstruction when all the characters are generated from the same tree topology, but the branch lengths vary (with possibly different tree shapes). Furthering work of Kolaczkowski and Thornton (2004, Nature 431: 980-984) and Chang (1996, Math. Biosci. 134: 189-216), we show examples where maximum likelihood (under a homogeneous model) is an inconsistent estimator of the tree. We then explore the prospects of phylogenetic inference under a heterogeneous model. In some models, there are examples where phylogenetic inference under any method is impossible-despite the fact that there is a common tree topology. In particular, there are nonidentifiable mixture distributions, i.e., multiple topologies generate identical mixture distributions. We address which evolutionary models have nonidentifiable mixture distributions and prove that the following duality theorem holds for most DNA substitution models. The model has either: (i) nonidentifiability-two different tree topologies can produce identical mixture distributions, and hence distinguishing between the two topologies is impossible; or (ii) linear tests-there exist linear tests which identify the common tree topology for character data generated by a mixture distribution. The theorem holds for models whose transition matrices can be parameterized by open sets, which includes most of the popular models, such as Tamura-Nei and Kimura's 2-parameter model. The duality theorem relies on our notion of linear tests, which are related to Lake's linear invariants.

Pitfalls of heterogeneous processes for phylogenetic reconstruction

Journal

SYSTEMATIC BIOLOGY

Publisher

TAYLOR & FRANCIS INC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Pitfalls of heterogeneous processes for phylogenetic reconstruction

Journal

SYSTEMATIC BIOLOGY

Publisher

TAYLOR & FRANCIS INC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper