4.4 Article

Density peaks clustering based on k-nearest neighbors sharing

出版社

WILEY
DOI: 10.1002/cpe.5993

关键词

density peak clustering; k-nearest neighbors; local density; natural neighbors; shared neighbors

资金

  1. National Natural Science Foundation of China [61663029]
  2. Natural Science Foundation of Jiangxi Province [2018ACB21029, 20192BAB207031]
  3. Science Fund for Distinguished Young Scholars of Jiangxi Province [2018ACB21029]

向作者/读者索取更多资源

The DPC algorithm performs poorly on complex data sets with large differences in density, flow pattern or cross-winding, and has relatively poor fault tolerance in sample allocation. This article proposes the DPC-KNNS algorithm to improve clustering performance on datasets with large density differences and complex patterns.
The density peaks clustering (DPC) algorithm is a density-based clustering algorithm. Its density peak depends on the density-distance model to determine it. The definition of local density for samples used in DPC algorithm only considers distance between samples, while the environments of samples are neglected. This leads to the result that DPC algorithm performs poorly on complex data sets with large difference in density, flow pattern or cross-winding. In the meantime, the fault tolerance of allocation strategy for samples is relatively poor. Based on the findings, this article proposes a density peaks clustering based on k-nearest neighbors sharing (DPC-KNNS) algorithm, which uses the similarity between shared neighbors and natural neighbors to define the local density of samples and the allocation. Comparison between theoretical analysis and experiments on various synthetic and real data reveal that the algorithm proposed in this article can discover the cluster center of complex data sets with large difference in density, flow pattern or cross-winding. It can also provide effective clustering.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据