4.6 Article

Soft Semi-Supervised Deep Learning-Based Clustering

Journal

APPLIED SCIENCES-BASEL
Volume 13, Issue 17, Pages -

Publisher

MDPI
DOI: 10.3390/app13179673

Keywords

deep clustering; semi-supervised clustering; soft constraints; fuzzy clustering

Ask authors/readers for more resources

In this paper, a novel semi-supervised deep clustering approach named SC-DEC is proposed to address the limitations exhibited by existing semi-supervised clustering approaches. The proposed approach leverages a deep neural network architecture to generate fuzzy membership degrees that better reflect the true partition of the data. Experimental results show that utilizing minimal previous knowledge about the data can improve the overall clustering performance.
Semi-supervised clustering typically relies on both labeled and unlabeled data to guide the learning process towards the optimal data partition and to prevent falling into local minima. However, researchers' efforts made to improve existing semi-supervised clustering approaches are relatively scarce compared to the contributions made to enhance the state-of-the-art fully unsupervised clustering approaches. In this paper, we propose a novel semi-supervised deep clustering approach, named Soft Constrained Deep Clustering (SC-DEC), that aims to address the limitations exhibited by existing semi-supervised clustering approaches. Specifically, the proposed approach leverages a deep neural network architecture and generates fuzzy membership degrees that better reflect the true partition of the data. In particular, the proposed approach uses side-information and formulates it as a set of soft pairwise constraints to supervise the machine learning process. This supervision information is expressed using rather relaxed constraints named should-link constraints. Such constraints determine whether the pairs of data instances should be assigned to the same or different cluster(s). In fact, the clustering task was formulated as an optimization problem via the minimization of a novel objective function. Moreover, the proposed approach's performance was assessed via extensive experiments using benchmark datasets. Furthermore, the proposed approach was compared to relevant state-of-the-art clustering algorithms, and the obtained results demonstrate the impact of using minimal previous knowledge about the data in improving the overall clustering performance.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available