4.7 Article

ROBP a robust border-peeling clustering using Cauchy kernel

期刊

INFORMATION SCIENCES
卷 571, 期 -, 页码 375-400

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2021.04.089

关键词

Density-based clustering; Border-peeling clustering; Cauchy kernel

资金

  1. National Natural Science Foundation of China [62006104, 61872168]
  2. Nat-ural Science Foundation of the Jiangsu Higher Education Institutions [20KJB520012]
  3. Scientific Research Founda-tion for Young Teachers by Jiangsu Normal University [18XLRX006]

向作者/读者索取更多资源

The border-peeling (BP) clustering algorithm is effective in recognizing cluster structures and detecting outliers, but may perform poorly on datasets with non-uniformly distributed clusters and complex shapes. To address these issues, a robust border-peeling clustering algorithm (ROBP) is proposed in this paper, which outperforms competitors in most cases. Experiment results demonstrate that ROBP is more robust, reliable, and computationally efficient compared to BP.
Recently a novel density-based clustering algorithm, namely, border-peeling (BP) clustering algorithm, is proposed to group data by iteratively identifying border points and peeling off them until separable areas of data remain. The BP clustering is able to correctly recognize the true structure of clusters and automatically detect the outliers on several test cases. However, there are some drawbacks in BP, and these may hinder its widespread application. The BP clustering might yield bad results on datasets with non-uniformly distributed clusters. Especially, the BP clustering tends to over-partition the data with complex shape. To overcome these defects, a robust border-peeling clustering algorithm (named as ROBP) is proposed in this paper. Our method improves the BP clustering algorithm from two aspects: density influence (i.e. density estimation) and linkage criterion (i.e. association strategy). In density estimation, we use Cauchy kernel with longer tails instead of Gaussian kernel in the local scaling function, and further propose a kernel density estimator, i.e., the density estimator based on Cauchy kernel. It can calculate quickly and accurately the density influence value of each point. In association strategy, we design a linkage criterion based on the shared neighborhood information. The linkage criterion can create some links between peeled border points and their neighboring peeled border points, in order to avoid over-segmentation of the clusters. We integrate the proposed linkage criterion and the uni-directional association strategy, and further propose a bidirectional association strategy. In experiments, we compare ROBP with 7 representative density-based clustering (or hierarchical clustering) algorithms, including BP, DBSCAN, HDBSCAN, density peak (DP) clustering, DPC-KNN, DPC-DBFN and McDPC, on 8 synthetic datasets and 11 real-world datasets. Results show that the proposed algorithm outperforms 7 competitors in most cases. Moreover, we compare the robustness of ROBP and BP, and evaluate their running time. Experimental results indicate that ROBP is much more robust and reliable, as well as it is competitive to BP in computational efficiency. (c) 2021 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据