4.7 Article

A novel method for optimizing spectral rotation embedding K-means with coordinate descent

期刊

INFORMATION SCIENCES
卷 612, 期 -, 页码 1095-1110

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2022.09.011

关键词

Clustering; K-means; Spectral clustering; Spectral rotation; Lloyd?s heuristic; Coordinate descent

资金

  1. National Key Research and Development Program [2020YFB1713700]
  2. Program of Natural Science Foundation of China [61963015, 61733005]

向作者/读者索取更多资源

In this study, we propose a novel optimization method for the K-Means model that improves the clustering performance by introducing an improved spectral rotation. The experiments demonstrate the effectiveness of the proposed algorithm.
Lloyd's heuristic K-Means, one of the most widely used clustering methods, plays a vital role in a variety of downstream tasks in machine learning owing to its simplicity. However, Lloyd's heuristic sometimes performs poorly in finding the local minimum and is heavily influenced by the initial points. To address these issues, we propose a novel opti-mization method for the K-Means model. First, we establish that the K-Means minimiza-tion problem can be reformulated as a trace maximization problem, which can be seen as a unified view of spectral clustering. Then, we relax the constraint of the scaled cluster matrix and implement an improved spectral rotation to bring the cluster matrix infinitely close to the binary indicator matrix. To this end, an efficient and redundancy-free coordi-nate descent (CD) method is used to optimize the spectral rotation. Extensive experiments including a hybrid test on several different datasets showed that the proposed algorithm achieved better local objective values compared to Lloyd's heuristic under different initial-ization strategies (random or K-Means++). In the hybrid test, the proposed algorithm could further decrease the convergence value of the objective function obtained by Lloyd's heuristic; conversely, Lloyd's heuristic did not work. Moreover, statistical hypothesis and comparison tests further validated the superiority of the proposed algorithm.(c) 2022 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据