4.2 Article

Initializingk-means Clustering by Bootstrap and Data Depth

Journal

JOURNAL OF CLASSIFICATION
Volume 38, Issue 2, Pages 232-256

Publisher

SPRINGER
DOI: 10.1007/s00357-020-09372-3

Keywords

k-means algorithm; Bootstrap; MBD data depth

Ask authors/readers for more resources

The paper introduces a simple and computationally feasible method to provide initial seeds for the k-means clustering algorithm, effectively addressing its sensitivity to initial conditions. The method has been tested and shown to be more feasible and efficient compared to commonly used initialization algorithms in various situations.
Thek-means algorithm is widely used in various research fields because of its fast convergence to the cost function minima; however, it frequently gets stuck in local optima as it is sensitive to initial conditions. This paper explores a simple, computationally feasible method, which providesk-means with a set of initial seeds to cluster datasets of arbitrary dimensions. Our technique consists of two stages: firstly, we use the original data space to obtain a set of prototypes (cluster centers) by applyingk-means to bootstrap replications of the data and, secondly, we cluster the space of centers, which has tighter (thus easier to separate) groups, and search the deepest point in each assembled cluster using a depth notion. We test this method with simulated and real data, compare it with commonly usedk-means initialization algorithms, and show that it is feasible and more efficient than previous proposals in many situations.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available