4.7 Article

A multi-center clustering algorithm based on mutual nearest neighbors for arbitrarily distributed data

Journal

INTEGRATED COMPUTER-AIDED ENGINEERING
Volume 29, Issue 3, Pages 259-275

Publisher

IOS PRESS
DOI: 10.3233/ICA-220682

Keywords

Multiple centers; data clustering; mutual nearest neighbors; arbitrary distribution

Funding

  1. NSFC [61872281]
  2. NKRDPC [2017YFC1703506]

Ask authors/readers for more resources

This paper presents a multi-center clustering algorithm based on mutual nearest neighbors, which can effectively cluster non-convex data sets without any parameters by adaptively finding center points and utilizing the role of multiple center points, and obtaining final clusters based on the degree of overlapping and distance.
Multi-center clustering algorithms have attracted the attention of researchers because they can deal with complex data sets more effectively. However, the reasonable determination of cluster centers and their number as well as the final clusters is a challenging problem. In order to solve this problem, we propose a multi-center clustering algorithm based on mutual nearest neighbors (briefly MC-MNN). Firstly, we design a center-point discovery algorithm based on mutual nearest neighbors, which can adaptively find center points without any parameters for data sets with different density distributions. Then, a sub-cluster discovery algorithm is designed based on the connection of center points. This algorithm can effectively utilize the role of multiple center points, and can effectively cluster non-convex data sets. Finally, we design a merging algorithm, which can effectively obtain final clusters based on the degree of overlapping and distance between sub-clusters. Compared with existing algorithms, the MC-MNN has four advantages: (1) It can automatically obtain center points by using the mutual nearest neighbors; (2) It runs without any parameters; (3) It can adaptively find the final number of clusters; (4) It can effectively cluster arbitrarily distributed data sets. Experiments show the effectiveness of the MC-MNN and its superiority is verified by comparing with five related algorithms.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available