4.5 Article

Fast Bayesian Inference in Dirichlet Process Mixture Models

Journal

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS
Volume 20, Issue 1, Pages 196-216

Publisher

AMER STATISTICAL ASSOC
DOI: 10.1198/jcgs.2010.07081

Keywords

Clustering; Density estimation; Efficient computation; Large samples; Nonparametric Bayes; Polya urn scheme; Sequential analysis

Funding

  1. NIEHS NIH HHS [R01 ES017240, R01 ES017436] Funding Source: Medline

Ask authors/readers for more resources

There has been increasing interest in applying Bayesian nonparametric methods in large samples and high dimensions. As Markov chain Monte Carlo (MCMC) algorithms are often infeasible, there is a pressing need for much faster algorithms. This article proposes a fast approach for inference in Dirichlet process mixture (DPM) models. Viewing the partitioning of subjects into clusters as a model selection problem, we propose a sequential greedy search algorithm for selecting the partition. Then, when conjugate priors are chosen, the resulting posterior conditionally on the selected partition is available in closed form. This approach allows testing of parametric models versus nonparametric alternatives based on Bayes factors. We evaluate the approach using simulation studies and compare it with four other fast nonparametric methods in the literature. We apply the proposed approach to three datasets including one from a large epidemiologic study. Matlab codes for the simulation and data analyses using the proposed approach are available online in the supplemental materials.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available