4.7 Article

A time-efficient pattern reduction algorithm for k-means clustering

Journal

INFORMATION SCIENCES
Volume 181, Issue 4, Pages 716-731

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2010.10.008

Keywords

Data clustering; k-means; Pattern reduction

Funding

  1. National Science Council, Taiwan, ROC [NSC99-2221-E-110-052, NSC98-2811-E-006-078]

Ask authors/readers for more resources

This paper presents an efficient algorithm, called pattern reduction (PR), for reducing the computation time of k-means and k-means-based clustering algorithms. The proposed algorithm works by compressing and removing at each iteration patterns that are unlikely to change their membership thereafter. Not only is the proposed algorithm simple and easy to implement, but it can also be applied to many other iterative clustering algorithms such as kernel-based and population-based clustering algorithms. Our experiments-from 2 to 1000 dimensions and 150 to 10,000,000 patterns-indicate that with a small loss of quality, the proposed algorithm can significantly reduce the computation time of all state-of-the-art clustering algorithms evaluated in this paper, especially for large and high-dimensional data sets. (C) 2010 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available