4.6 Article

Spectral clustering with eigenvector selection based on entropy ranking

期刊

NEUROCOMPUTING
卷 73, 期 10-12, 页码 1704-1717

出版社

ELSEVIER
DOI: 10.1016/j.neucom.2009.12.029

关键词

Pattern recognition; Spectral clustering; Eigenvector selection; Entropy ranking

资金

  1. National Natural Science Foundation of China [60703108, 60702062, 60970067]
  2. National High Technology Research and Development Program (863 Program) of China [2009AA12Z210]
  3. National Basic Research Program (973 Program) of China [2006CB705707]
  4. Program for Cheung Kong Scholars and Innovative Research Team in University [IRT0645]

向作者/读者索取更多资源

Ng-Jordan-Weiss (NJW) method is one of the most widely used spectral clustering algorithms. For a K clustering problem, this method partitions data using the largest K eigenvectors of the normalized affinity matrix derived from the dataset. It has been demonstrated that the spectral relaxation solution of K-way grouping is located on the subspace of the largest K eigenvectors. However, we find from a lot of experiments that the top K eigenvectors cannot always detect the structure of the data for real pattern recognition problems. So it is necessary to select eigenvectors for spectral clustering. We propose an eigenvector selection method based on entropy ranking for spectral clustering (ESBER). In this method, first all the eigenvectors are ranked according to their importance on clustering, and then a suitable eigenvector combination is obtained from the ranking list. In this paper, we propose two strategies to select eigenvectors in the ranking list of eigenvectors. One is directly adopting the first K eigenvectors in the ranking list. Different to the largest K eigenvectors of NJW method, these K eigenvectors are the most important eigenvectors among all the eigenvectors. The other eigenvector selection strategy is to search a suitable eigenvector combination among the first Km (Km>K) eigenvectors in the ranking list. The eigenvector combination obtained by this strategy can reflect the structure of the original data and lead to a satisfying spectral clustering result. Furthermore, we also present computational complexity reduction strategies for ESBER method to deal with large-scale datasets. We have performed experiments on UCI benchmark datasets, MNIST handwritten digits datasets, and Brodatz texture datasets, adopting NJW method for a baseline comparison. The experimental results show that ESBER method is more robust than NJW method. Especially. ESBER method with the latter eigenvector selection strategy can obtain satisfying clustering results in most cases. (C) 2010 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据