☆ 4.7 Article

Study on density peaks clustering based on k-nearest neighbors and principal component analysis

KNOWLEDGE-BASED SYSTEMS (2016)

期刊

KNOWLEDGE-BASED SYSTEMS

卷 99, 期 -, 页码 135-145

出版社

ELSEVIER SCIENCE BV

DOI: 10.1016/j.knosys.2016.02.001

关键词

Data clustering; Density peaks; k Nearest neighbors (KNN); Principal component analysis (PCA)

类别

Computer Science, Artificial Intelligence

资金

National Natural Science Foundation of China [61379101]
National Key Basic Research Program of China [2013CB329502]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Density peaks clustering (DPC) algorithm published in the US journal Science in 2014 is a novel clustering algorithm based on density. It needs neither iterative process nor more parameters. However, original algorithm only has taken into account the global structure of data, which leads to missing many clusters. In addition, DPC does not perform well when data sets have relatively high dimension. Especially, DPC generates wrong number of clusters of real-world data sets. In order to overcome the first problem, we propose a density peaks clustering based on k nearest neighbors (DPC-KNN) which introduces the idea of k nearest neighbors (KNN) into DPC and has another option for the local density computation. In order to overcome the second problem, we introduce principal component analysis (PCA) into the model of DPC-KNN and further bring forward a method based on PCA (DPC-KNN-PCA), which preprocesses high dimensional data. By experiments on synthetic data sets, we demonstrate the feasibility of our algorithms. By experiments on real-world data sets, we compared this algorithm with k-means algorithm and spectral clustering (SC) algorithm in accuracy. Experimental results show that our algorithms are feasible and effective. (C) 2016 Elsevier B.V. All rights reserved.

Study on density peaks clustering based on k-nearest neighbors and principal component analysis

期刊

KNOWLEDGE-BASED SYSTEMS

出版社

ELSEVIER SCIENCE BV

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Study on density peaks clustering based on k-nearest neighbors and principal component analysis

期刊

KNOWLEDGE-BASED SYSTEMS

出版社

ELSEVIER SCIENCE BV

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文