4.7 Article

Clustering with Local Density Peaks-Based Minimum Spanning Tree

Journal

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TKDE.2019.2930056

Keywords

minimum spanning tree; clustering; local density peaks; shared neighbors-based distance

Funding

  1. Project of Chongqing Education Commission [KJZH17104]
  2. National Natural Science Foundation of China [61502060, 61702060]

Ask authors/readers for more resources

The paper introduces a novel MST-based clustering algorithm LDP-MST, which utilizes local density peaks and a new distance measurement method to effectively discover clusters with complex structures. The experimental results demonstrate that the proposed algorithm is competitive with state-of-the-art methods in cluster discovery.
Clustering analysis has been widely used in statistics, machine learning, pattern recognition, image processing, and so on. It is a great challenge for most existing clustering algorithms to discover clusters with arbitrary shapes. Clustering algorithms based on Minimum spanning tree (MST) are able to discover clusters with arbitrary shapes, but they are time consuming and susceptible to noise points. In this paper, we employ local density peaks (LDP) to represent the whole data set and define a shared neighbors-based distance between local density peaks to better measure the dissimilarity between objects on manifold data. On the basis of local density peaks and the new distance, we propose a novel MST-based clustering algorithm called LDP-MST. It first uses local density peaks to construct MST and then repeatedly cuts the longest edge until a given number of clusters are found. The experimental results on synthetic data sets and real data sets show that our algorithm is competent with state-of-the-art methods when discovering clusters with complex structures.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available