4.7 Article

An efficient k-means clustering filtering algorithm using density based initial cluster centers

Journal

INFORMATION SCIENCES
Volume 418, Issue -, Pages 286-301

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2017.07.036

Keywords

k-means clustering; kd-tree; Initial cluster centers; Knowledge discovery

Funding

  1. Department of Higher Education, Ministry of Human Resource Development (MHRD), Govt. of India, under Technical Education Quality Improvement Program [TEQIP-II-1.2]

Ask authors/readers for more resources

k-means is a preeminent partitional based clustering method that finds k clusters from the given dataset by computing distances from each point to k cluster centers iteratively. The filtering algorithm improves the performance of k-means by imposing an index structure on the dataset and reduces the number of cluster centers searched while finding the nearest center of a point. The performance of filtering algorithm is influenced by the degree of separation between initial cluster centers. In this paper, we propose an efficient initial seed selection method, RDBI, to improve the performance of k-means filtering method by locating the seed points at dense areas of the dataset and well separated. The dense areas are identified by representing the data points in a kd-tree. A comprehensive experimental analysis is performed to evaluate the performance efficiency of proposed method against state-of-the-art initialization methods and shown that the proposed method is efficient in terms of both running time and clustering accuracy. (C) 2017 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available