4.6 Article

Elastic Differential Evolution for Automatic Data Clustering

Journal

IEEE TRANSACTIONS ON CYBERNETICS
Volume 51, Issue 8, Pages 4134-4147

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCYB.2019.2941707

Keywords

Clustering algorithms; Optimization; Partitioning algorithms; Encoding; Sociology; Statistics; Approximation algorithms; Clustering; differential evolution; elastic encoding; subspace

Funding

  1. National Natural Science Foundation of China [61873095, 61873097, U1701267]
  2. Guangzhou Science and Technology Planning Project [201904010211]
  3. Guangdong Natural Science Foundation Research Team Project [2018B030312003]
  4. Guangdong-Hong Kong Joint Innovation Platform Project [2018B050502006]

Ask authors/readers for more resources

This article introduces a novel elastic differential evolution algorithm for automatic data clustering, which adapts the number of clusters and centroids through variable-length encoding and evolution operators, eliminating encoding redundancy. The algorithm considers each clustering layout as a whole and includes subspace crossover and two-phase mutation operators to properly exchange information among individuals of different lengths.
In many practical applications, it is crucial to perform automatic data clustering without knowing the number of clusters in advance. The evolutionary computation paradigm is good at dealing with this task, but the existing algorithms encounter several deficiencies, such as the encoding redundancy and the cross-dimension learning error. In this article, we propose a novel elastic differential evolution algorithm to solve automatic data clustering. Unlike traditional methods, the proposed algorithm considers each clustering layout as a whole and adapts the cluster number and cluster centroids inherently through the variable-length encoding and the evolution operators. The encoding scheme contains no redundancy. To enable the individuals of different lengths to exchange information properly, we develop a subspace crossover and a two-phase mutation operator. The operators employ the basic method of differential evolution and, in addition, they consider the spatial information of cluster layouts to generate offspring solutions. Particularly, each dimension of the parameter vector interacts with its correlated dimensions, which not only adapts the cluster number but also avoids the cross-dimension learning error. The experimental results show that our algorithm outperforms the state-of-the-art algorithms that it is able to identify the correct number of clusters and obtain a good cluster validation value.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available