4.7 Article

A time-efficient pattern reduction algorithm for k-means clustering

期刊

INFORMATION SCIENCES
卷 181, 期 4, 页码 716-731

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2010.10.008

关键词

Data clustering; k-means; Pattern reduction

资金

  1. National Science Council, Taiwan, ROC [NSC99-2221-E-110-052, NSC98-2811-E-006-078]

向作者/读者索取更多资源

This paper presents an efficient algorithm, called pattern reduction (PR), for reducing the computation time of k-means and k-means-based clustering algorithms. The proposed algorithm works by compressing and removing at each iteration patterns that are unlikely to change their membership thereafter. Not only is the proposed algorithm simple and easy to implement, but it can also be applied to many other iterative clustering algorithms such as kernel-based and population-based clustering algorithms. Our experiments-from 2 to 1000 dimensions and 150 to 10,000,000 patterns-indicate that with a small loss of quality, the proposed algorithm can significantly reduce the computation time of all state-of-the-art clustering algorithms evaluated in this paper, especially for large and high-dimensional data sets. (C) 2010 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据