4.7 Article

Non-parameter clustering algorithm based on saturated neighborhood graph

Journal

APPLIED SOFT COMPUTING
Volume 130, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.asoc.2022.109647

Keywords

Clustering Data mining; Local density; Natural neighbor; Saturated neighborhood graph

Funding

  1. Fundamental Research Funds for the Central Universities, China [2020NQN41]
  2. National Natural Science Foundation of China [62171390]

Ask authors/readers for more resources

This paper introduces a non-parameter clustering algorithm based on saturated neighborhood graph (NPCSNG), which preprocesses the data set using mathematical methods and clusters the data using the characteristics of SNG adaptive clustering to achieve non-parameter clustering. The NPCSNG algorithm has the advantages of not requiring manual parameter setting, significantly improving clustering performance and model robustness, and adapting easily to data sets with complex manifold structure.
Clustering algorithms play a very important role in the field of data mining and machine learning. However existing clustering methods are sensitive to parameters and outliers. The commonly used clustering methods are restricted by problem of parameter selection that different algorithms need to input one or more different parameters. For overcoming these drawbacks, we propose a non-parameter clustering algorithm based on saturated neighborhood graph known as NPCSNG. NPCSNG algorithm uses mathematic method to preprocess the data set, and then uses the characteristics of SNG adaptive clustering to cluster the data, so as to achieve the purpose of non-parameter clustering. NPCSNG has three main advantages: (1) it does not need to manually set any parameters due to the use of adaptive saturated neighborhood graph; (2) it significantly improves the clustering performance as well as the model robustness, making NPCSNG a more practical approach for real-world scenarios; (3) it can easily adapt to data-sets with complex manifold structure. NPCSNG algorithm solves the problem of parameter selection of clustering algorithm and it broadens the idea of clustering by using the characteristics of graphs. (c) 2022 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available