4.7 Article

A fast density peaks clustering algorithm with sparse search

期刊

INFORMATION SCIENCES
卷 554, 期 -, 页码 61-83

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2020.11.050

关键词

DPC algorithm; Computational complexity; Sparse search strategy; Fewer distance calculations; Similarity matrix

资金

  1. National Natural Science Foundations of China [61672522, 61976216]

向作者/读者索取更多资源

A novel fast sparse search density peaks clustering (FSDPC) algorithm is proposed to enhance the efficiency of density peaks clustering (DPC) by using a sparse search strategy and a random third-party data point method, which outperforms the DPC and other state-of-the-art algorithms in terms of both computational complexity and clustering performance.
Given a large unlabeled set of complex data, how to efficiently and effectively group them into clusters remains a challenging problem. Density peaks clustering (DPC) algorithm is an emerging algorithm, which identifies cluster centers based on a decision graph. Without setting the number of cluster centers, DPC can effectively recognize the clusters. However, the similarity between every two data points must be calculated to construct a decision graph, which results in high computational complexity. To overcome this issue, we propose a fast sparse search density peaks clustering (FSDPC) algorithm to enhance the DPC, which constructs a decision graph with fewer similarity calculations to identify cluster centers quickly. In FSDPC, we design a novel sparse search strategy to measure the similarity between the nearest neighbors of each data points. Therefore, FSDPC can enhance the efficiency of the DPC while maintaining satisfactory results. We also propose a novel random third-party data point method to search the nearest neighbors, which introduces no additional parameters or high computational complexity. The experimental results on synthetic datasets and real-world datasets indicate that the proposed algorithm consistently outperforms the DPC and other state-of-the-art algorithms. (C) 2020 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据