4.7 Article

Robust deep k-means: An effective and simple method for data clustering

期刊

PATTERN RECOGNITION
卷 117, 期 -, 页码 -

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.patcog.2021.107996

关键词

k-means algorithm; Robust clustering; Deep learning

资金

  1. State Key Program of the National Science Foundation of China [61836006]
  2. National Natural Science Fund for Distinguished Young Scholar [61625204]
  3. Sichuan Science and Technology Program [2020YFG0323]
  4. Fundamental Research Funds for the Central Universities [1082204112364]

向作者/读者索取更多资源

Clustering aims to divide input data into different groups based on distance or similarity, with k-means being a widely used method. A deep k-means model is proposed in this study to improve clustering performance by extracting deep representations using deep learning techniques.
Clustering aims to partition an input dataset into distinct groups according to some distance or similarity measurements. One of the most widely used clustering method nowadays is the k-means algorithm because of its simplicity and efficiency. In the last few decades, k-means and its various extensions have been formulated to solve the practical clustering problems. However, existing clustering methods are often presented in a single-layer formulation (i.e., shallow formulation). As a result, the mapping between the obtained low-level representation and the original input data may contain rather complex hierarchical information. To overcome the drawbacks of low-level features, deep learning techniques are adopted to extract deep representations and improve the clustering performance. In this paper, we propose a robust deep k-means model to learn the hidden representations associate with different implicit lower-level attributes. By using the deep structure to hierarchically perform k-means, the hierarchical semantics of data can be exploited in a layerwise way. Data samples from the same class are forced to be closer layer by layer, which is beneficial for clustering task. The objective function of our model is derived to a more trackable form such that the optimization problem can be tackled more easily and the final robust results can be obtained. Experimental results over 12 benchmark data sets substantiate that the proposed model achieves a breakthrough in clustering performance, compared with both classical and state-of-theart methods.& nbsp; (c) 2021 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据