☆ 4.7 Article

A time-efficient pattern reduction algorithm for k-means clustering

INFORMATION SCIENCES (2011)

期刊

INFORMATION SCIENCES

卷 181, 期 4, 页码 716-731

出版社

ELSEVIER SCIENCE INC

DOI: 10.1016/j.ins.2010.10.008

关键词

Data clustering; k-means; Pattern reduction

类别

Computer Science, Information Systems

资金

National Science Council, Taiwan, ROC [NSC99-2221-E-110-052, NSC98-2811-E-006-078]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

This paper presents an efficient algorithm, called pattern reduction (PR), for reducing the computation time of k-means and k-means-based clustering algorithms. The proposed algorithm works by compressing and removing at each iteration patterns that are unlikely to change their membership thereafter. Not only is the proposed algorithm simple and easy to implement, but it can also be applied to many other iterative clustering algorithms such as kernel-based and population-based clustering algorithms. Our experiments-from 2 to 1000 dimensions and 150 to 10,000,000 patterns-indicate that with a small loss of quality, the proposed algorithm can significantly reduce the computation time of all state-of-the-art clustering algorithms evaluated in this paper, especially for large and high-dimensional data sets. (C) 2010 Elsevier Inc. All rights reserved.

A time-efficient pattern reduction algorithm for k-means clustering

期刊

INFORMATION SCIENCES

出版社

ELSEVIER SCIENCE INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A time-efficient pattern reduction algorithm for k-means clustering

期刊

INFORMATION SCIENCES

出版社

ELSEVIER SCIENCE INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文