4.6 Article Proceedings Paper

Cluster identification using projections

期刊

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
卷 96, 期 456, 页码 1433-1445

出版社

AMER STATISTICAL ASSOC
DOI: 10.1198/016214501753382345

关键词

classification; kurtosis; multivariate analysis; robustness; spacings

向作者/读者索取更多资源

This article describes a procedure to identify clusters in multivariate data using information obtained from the univariate projections of the sample data onto certain directions. The directions are chosen as those that minimize and maximize the kurtosis coefficient of the projected data. It is shown that, under certain conditions, these directions provide the largest separation for the different clusters. The projected univariate data are used to group the observations according to the values of the gaps or spacings between consecutive-ordered observations. These groupings are then combined over all projection directions. The behavior of the method is tested on several examples, and compared to k-means, MCLUST, and the procedure proposed by Jones and Sibson in 1987. The proposed algorithm is iterative, affine equivariant, flexible, robust to outliers, fast to implement, and seems to work well in practice.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据