☆ 4.5 Article

On the Inference of Dirichlet Mixture Priors for Protein Sequence Comparison

JOURNAL OF COMPUTATIONAL BIOLOGY (2011)

期刊

JOURNAL OF COMPUTATIONAL BIOLOGY

卷 18, 期 8, 页码 941-954

出版社

MARY ANN LIEBERT INC

DOI: 10.1089/cmb.2011.0040

关键词

algorithms; combinatorics; linear programming; machine learning; statistics

类别

Biochemical Research Methods Biotechnology & Applied Microbiology Computer Science, Interdisciplinary Applications Mathematical & Computational Biology Statistics & Probability

资金

National Library of Medicine at the National Institutes of Health

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Dirichlet mixtures provide an elegant formalism for constructing and evaluating protein multiple sequence alignments. Their use requires the inference of Dirichlet mixture priors from curated sets of accurately aligned sequences. This article addresses two questions relevant to such inference: of how many components should a Dirichlet mixture consist, and how may a maximum-likelihood mixture be derived from a given data set. To apply the Minimum Description Length principle to the first question, we extend an analytic formula for the complexity of a Dirichlet model to Dirichlet mixtures by informal argument. We apply a Gibbs-sampling based approach to the second question. Using artificial data generated by a Dirichlet mixture, we demonstrate that our methods are able to approximate well the true theory, when it exists. We apply our methods as well to real data, and infer Dirichlet mixtures that describe the data better than does a mixture derived using previous approaches.

On the Inference of Dirichlet Mixture Priors for Protein Sequence Comparison

期刊

JOURNAL OF COMPUTATIONAL BIOLOGY

出版社

MARY ANN LIEBERT INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

On the Inference of Dirichlet Mixture Priors for Protein Sequence Comparison

期刊

JOURNAL OF COMPUTATIONAL BIOLOGY

出版社

MARY ANN LIEBERT INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文