4.5 Article

A maximum likelihood approximation method for Dirichlet's parameter estimation

期刊

COMPUTATIONAL STATISTICS & DATA ANALYSIS
卷 52, 期 3, 页码 1315-1322

出版社

ELSEVIER SCIENCE BV
DOI: 10.1016/j.csda.2007.07.011

关键词

Dirichlet distribution; maximum likelihood; parameter estimation; proteins clustering

向作者/读者索取更多资源

Dirichlet distributions are natural choices to analyse data described by frequencies or proportions since they are the simplest known distributions for such data apart from the uniform distribution. They are often used whenever proportions are involved, for example, in text-mining, image analysis, biology or as a prior of a multinomial distribution in Bayesian statistics. As the Dirichlet distribution belongs to the exponential family, its parameters can be easily inferred by maximum likelihood. Parameter estimation is usually performed with the Newton-Raphson algorithm after an initialisation step using either the moments or Ronning's methods. However this initialisation can result in parameters that lie outside the admissible region. A simple and very efficient alternative based on a maximum likelihood approximation is presented. The advantages of the presented method compared to two other methods are demonstrated on synthetic data sets as well as for a practical biological problem: the clustering of protein sequences based on their amino acid compositions. (c) 2007 Elsevier B.V All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据