☆ 4.5 Article

On the Inference of Dirichlet Mixture Priors for Protein Sequence Comparison

JOURNAL OF COMPUTATIONAL BIOLOGY (2011)

Journal

JOURNAL OF COMPUTATIONAL BIOLOGY

Volume 18, Issue 8, Pages 941-954

Publisher

MARY ANN LIEBERT INC

DOI: 10.1089/cmb.2011.0040

Keywords

algorithms; combinatorics; linear programming; machine learning; statistics

Funding

National Library of Medicine at the National Institutes of Health

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Dirichlet mixtures provide an elegant formalism for constructing and evaluating protein multiple sequence alignments. Their use requires the inference of Dirichlet mixture priors from curated sets of accurately aligned sequences. This article addresses two questions relevant to such inference: of how many components should a Dirichlet mixture consist, and how may a maximum-likelihood mixture be derived from a given data set. To apply the Minimum Description Length principle to the first question, we extend an analytic formula for the complexity of a Dirichlet model to Dirichlet mixtures by informal argument. We apply a Gibbs-sampling based approach to the second question. Using artificial data generated by a Dirichlet mixture, we demonstrate that our methods are able to approximate well the true theory, when it exists. We apply our methods as well to real data, and infer Dirichlet mixtures that describe the data better than does a mixture derived using previous approaches.

On the Inference of Dirichlet Mixture Priors for Protein Sequence Comparison

Journal

JOURNAL OF COMPUTATIONAL BIOLOGY

Publisher

MARY ANN LIEBERT INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

On the Inference of Dirichlet Mixture Priors for Protein Sequence Comparison

Journal

JOURNAL OF COMPUTATIONAL BIOLOGY

Publisher

MARY ANN LIEBERT INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper