4.7 Article

Self-Expressive Kernel Subspace Clustering Algorithm for Categorical Data with Embedded Feature Selection

Journal

MATHEMATICS
Volume 9, Issue 14, Pages -

Publisher

MDPI
DOI: 10.3390/math9141680

Keywords

machine learning; categorical data; similarity; feature selection; kernel density estimation; non-linear optimization; kernel clustering

Categories

Funding

  1. Key-Area Research and Development Program of Guangdong Province [2019B010137002]
  2. National Natural Science Foundation of China [U1805263, 61672157]

Ask authors/readers for more resources

The study introduces a novel self-expressive kernel subspace clustering algorithm, which utilizes self-expressive kernel density estimation and a new feature-weighted non-linear similarity measurement. This algorithm employs an effective non-linear optimization method to solve the clustering objective function, achieving better effectiveness and efficiency in exploring non-linear relationships among attributes.
Kernel clustering of categorical data is a useful tool to process the separable datasets and has been employed in many disciplines. Despite recent efforts, existing methods for kernel clustering remain a significant challenge due to the assumption of feature independence and equal weights. In this study, we propose a self-expressive kernel subspace clustering algorithm for categorical data (SKSCC) using the self-expressive kernel density estimation (SKDE) scheme, as well as a new feature-weighted non-linear similarity measurement. In the SKSCC algorithm, we propose an effective non-linear optimization method to solve the clustering algorithm's objective function, which not only considers the relationship between attributes in a non-linear space but also assigns a weight to each attribute in the algorithm to measure the degree of correlation. A series of experiments on some widely used synthetic and real-world datasets demonstrated the better effectiveness and efficiency of the proposed algorithm compared with other state-of-the-art methods, in terms of non-linear relationship exploration among attributes.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available