4.7 Article

ROBP a robust border-peeling clustering using Cauchy kernel

Journal

INFORMATION SCIENCES
Volume 571, Issue -, Pages 375-400

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2021.04.089

Keywords

Density-based clustering; Border-peeling clustering; Cauchy kernel

Funding

  1. National Natural Science Foundation of China [62006104, 61872168]
  2. Nat-ural Science Foundation of the Jiangsu Higher Education Institutions [20KJB520012]
  3. Scientific Research Founda-tion for Young Teachers by Jiangsu Normal University [18XLRX006]

Ask authors/readers for more resources

The border-peeling (BP) clustering algorithm is effective in recognizing cluster structures and detecting outliers, but may perform poorly on datasets with non-uniformly distributed clusters and complex shapes. To address these issues, a robust border-peeling clustering algorithm (ROBP) is proposed in this paper, which outperforms competitors in most cases. Experiment results demonstrate that ROBP is more robust, reliable, and computationally efficient compared to BP.
Recently a novel density-based clustering algorithm, namely, border-peeling (BP) clustering algorithm, is proposed to group data by iteratively identifying border points and peeling off them until separable areas of data remain. The BP clustering is able to correctly recognize the true structure of clusters and automatically detect the outliers on several test cases. However, there are some drawbacks in BP, and these may hinder its widespread application. The BP clustering might yield bad results on datasets with non-uniformly distributed clusters. Especially, the BP clustering tends to over-partition the data with complex shape. To overcome these defects, a robust border-peeling clustering algorithm (named as ROBP) is proposed in this paper. Our method improves the BP clustering algorithm from two aspects: density influence (i.e. density estimation) and linkage criterion (i.e. association strategy). In density estimation, we use Cauchy kernel with longer tails instead of Gaussian kernel in the local scaling function, and further propose a kernel density estimator, i.e., the density estimator based on Cauchy kernel. It can calculate quickly and accurately the density influence value of each point. In association strategy, we design a linkage criterion based on the shared neighborhood information. The linkage criterion can create some links between peeled border points and their neighboring peeled border points, in order to avoid over-segmentation of the clusters. We integrate the proposed linkage criterion and the uni-directional association strategy, and further propose a bidirectional association strategy. In experiments, we compare ROBP with 7 representative density-based clustering (or hierarchical clustering) algorithms, including BP, DBSCAN, HDBSCAN, density peak (DP) clustering, DPC-KNN, DPC-DBFN and McDPC, on 8 synthetic datasets and 11 real-world datasets. Results show that the proposed algorithm outperforms 7 competitors in most cases. Moreover, we compare the robustness of ROBP and BP, and evaluate their running time. Experimental results indicate that ROBP is much more robust and reliable, as well as it is competitive to BP in computational efficiency. (c) 2021 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available