☆ 4.6 Review

Model-based clustering, discriminant analysis, and density estimation

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION (2002)

Journal

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION

Volume 97, Issue 458, Pages 611-631

Publisher

TAYLOR & FRANCIS INC

DOI: 10.1198/016214502760047131

Keywords

Bayes factor; breast cancer diagnosis; cluster analysis; EM algorithm; gene expression microarray data; Markov chain Monte Carlo; mixture model; outliers; spatial point process

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Cluster analysis is the automated search for groups of related observations in a dataset. Most clustering done in practice is based largely on heuristic but intuitively reasonable procedures, and most clustering methods available in commercial software are also of this type. However, there is little systematic guidance associated with these methods for solving important practical questions that arise in cluster analysis, such as how many clusters are there, which clustering method should be used, and how should outliers be handled. We review a general methodology for model-based clustering that provides a principled statistical approach to these issues. We also show that this can be useful for other problems in multivariate analysis, such as discriminant analysis and multivariate density estimation. We give examples from medical diagnosis, minefield detection, cluster recovery from noisy data, and spatial density estimation. Finally, we mention limitations of the methodology and discuss recent developments in model-based clustering for non-Gaussian data, high-dimensional datasets, large datasets, and Bayesian estimation.

Model-based clustering, discriminant analysis, and density estimation

Journal

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION

Publisher

TAYLOR & FRANCIS INC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Model-based clustering, discriminant analysis, and density estimation

Journal

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION

Publisher

TAYLOR & FRANCIS INC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper