4.5 Article

Estimating the Optimal Number of Clusters Via Internal Validity Index

Journal

NEURAL PROCESSING LETTERS
Volume 53, Issue 2, Pages 1013-1034

Publisher

SPRINGER
DOI: 10.1007/s11063-021-10427-8

Keywords

Clustering validity index; Number of clusters; Affinity propagation; Hierarchical clustering

Funding

  1. Fundamental Research Funds for the Central Universities [JUSRP11235]
  2. National Natural Science Foundation of China [61673193, 61833007]

Ask authors/readers for more resources

Estimating the optimal number of clusters (NC) is crucial in cluster analysis. A novel internal clustering validity index BWC is proposed in this paper to improve the Silhouette index, validating clustering results and estimating the optimal NC for various types of data sets. Theoretical analysis and experimental studies demonstrate the effectiveness and efficiency of the new index and method.
Estimating the optimal number of clusters (NC) is pivotal in cluster analysis. From the viewpoint of sample geometry, a novel internal clustering validity index, which is termed the between-within cluster (BWC) index, is designed in this paper. Moreover, a method is proposed to estimate the optimal NC. The BWC index improves the well-known Silhouette index. BWC validates the clustering results from a certain clustering algorithm (e.g., affinity propagation or hierarchical) and estimates the optimal NC for many kinds of data sets, including synthetic data sets, benchmark data sets, UCI data sets, gene expression data sets, and images. Theoretical analysis and experimental studies demonstrate the effectiveness and high efficiency of the new index and method.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available