4.5 Article

Density peaks clustering algorithm based on improved similarity and allocation strategy

Journal

Publisher

SPRINGER HEIDELBERG
DOI: 10.1007/s13042-022-01711-7

Keywords

Density peaks clustering; Point variance; Manhattan distance; Non-central point allocation strategy

Ask authors/readers for more resources

The improved density peaks clustering algorithms (DPCV and MDDPC) based on the density peaks clustering algorithm have improved clustering accuracy by using methods such as variance and Manhattan distance, while solving problems such as the inability to effectively identify cluster centers and the chain reaction caused by error allocation.
Density peaks clustering (DPC) algorithm provides an efficient method to quickly find cluster centers with decision graph. In recent years, due to its unique parameter, no iteration, and good robustness, DPC has been widely studied and applied. However, it also has some shortcomings, such as unable to effectively identify cluster centers and the chain reaction caused by non-central points error allocation. Aiming at these two shortcomings of DPC, an improved density peaks clustering based on variance (DPCV) is proposed. First, the algorithm uses the variance between points to improve similarity and reduce the density difference of unevenly distributed data sets. Then, according to the similar density relationship between a cluster center and surrounding points, the low-density points are used as the dividing boundary of the initial allocation process. In order to optimize the time consumption of calculating the variance, this paper replaces the variance with the Manhattan distance between points and proposes density peaks clustering based on Manhattan distance (MDDPC). Theoretical analysis and experiments on artificial data and UCI data sets show that, compared with DPC and its improved algorithms, DPCV and MDDPC further improve the clustering accuracy of the DPC algorithm while controlling the running time.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available