4.6 Article

Towards information-theoretic K-means clustering for image indexing

期刊

SIGNAL PROCESSING
卷 93, 期 7, 页码 2026-2037

出版社

ELSEVIER SCIENCE BV
DOI: 10.1016/j.sigpro.2012.07.030

关键词

Information-theoretic clustering; K-means; KL-divergence; Variable Neighborhood Search (VNS)

资金

  1. National Natural Science Foundation of China [71072172, 61103229, 70901002, 71171007, 71031001, 70890080, 90924020]
  2. Jiangsu Provincial Colleges and Universities Outstanding S&T Innovation Team Fund [2001013]
  3. Key Project of Natural Science Research in Jiangsu Provincial Colleges and Universities [12KJA520001]

向作者/读者索取更多资源

Information-theoretic K-means (Info-Kmeans) aims to cluster high-dimensional data, such as images featured by the bag-of-features (BOF) model, using K-means algorithm with KL-divergence as the distance. While research efforts along this line have shown promising results, a remaining challenge is to deal with the high sparsity of image data. Indeed, the centroids may contain many zero-value features that create a dilemma in assigning objects to centroids during the iterative process of Info-Kmeans. To meet this challenge, we propose a Summation-bAsed Incremental Learning (SAIL) algorithm for Info-Kmeans clustering in this paper. Specifically, SAIL can avoid the zero-feature dilemma by replacing the computation of KL-divergence between instances and centroids, by the computation of centroid entropies only. To further improve the clustering quality, we also introduce the Variable Neighborhood Search (VNS) meta-heuristic and propose the V-SAIL algorithm. Experimental results on various benchmark data sets clearly demonstrate the effectiveness of SAIL and V-SAIL. In particular, they help to successfully recognize nine out of 11 landmarks from extremely high-dimensional and sparse image vectors, with the presence of severe noise. (C) 2012 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据