4.4 Article

Local optima in K-means clustering:: What you don't know may hurt you

期刊

PSYCHOLOGICAL METHODS
卷 8, 期 3, 页码 294-304

出版社

AMER PSYCHOLOGICAL ASSOC
DOI: 10.1037/1082-989X.8.3.294

关键词

-

向作者/读者索取更多资源

The popular K-means clustering method, as implemented in 3 commercial software packages (SPSS, SYSTAT, and SAS), generally provides solutions that are only locally optimal for a given set of data. Because none of these commercial implementations offer a reasonable mechanism to begin the K-means method at alternative starting points, separate routines were written within the MATLAB (MathWorks, 1999) environment that can be initialized randomly (these routines are provided at the end of the online version of this article in the PsycARTICLES database). Through the analysis of 2 empirical data sets and 8 10 simulated data sets, it is shown that the results provided by commercial packages are most likely locally optimal. These results suggest the need for some strategy to study the local optima problem for a specific data set or to identify methods for finding good starting values that might lead to the best solutions possible.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据