4.6 Article

An integrated K-means - Laplacian cluster ensemble approach for document datasets

期刊

NEUROCOMPUTING
卷 214, 期 -, 页码 495-507

出版社

ELSEVIER
DOI: 10.1016/j.neucom.2016.06.034

关键词

Cluster analysis; Cluster ensemble; K-means; Laplacian

资金

  1. US National Institutes of Health (NIH) [U01 HL114494, NIHRO1HL089897]
  2. National Natural Science Foundation of China [61105057, 61375001]
  3. Natural Science Foundation of Jiangsu Province [BK20151299]
  4. Industry-Education-Research prospective project of Jiangsu Province of China [BY2014108-20, BY2015057-33]
  5. Jiangsu Province Qing Lan Project, Nature Science Foundation of the Jiangsu Higher Education Institutes of China [13KJB520024]
  6. Science and Technology Support Program of Jiangsu Province [BE2014679]
  7. Talent Introduction Project of Yancheng Institute of Technology [XKR2011019]

向作者/读者索取更多资源

Cluster ensemble has become an important extension to traditional clustering algorithms, yet the cluster ensemble problem is very challenging due to the inherent difficulty in resolving the label correspondence problem. We adapted the integrated K-means - Laplacian clustering approach to solve the cluster ensemble problem by exploiting both the attribute information embedded in the cluster labels and the pairwise relations among the objects. The optimal solution of the proposed approach requires computing the pseudo inverse of the normalized Laplacian matrix and the eigenvalue decomposition of a large matrix, which can be computationally burdensome for large scale document datasets. We devised an effective algebraic transformation method for efficiently carrying out the aforementioned computations and proposed an integrated K-means - Laplacian cluster ensemble approach (IKLCEA). Experimental results with benchmark document datasets demonstrate that IKLCEA outperforms other cluster ensemble techniques on most cases. In addition, IKLCEA is computationally efficient and can be readily employed in large scale document applications. (C) 2016 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据