4.7 Article

Spectral Ensemble Clustering via Weighted K-Means: Theoretical and Practical Evidence

Journal

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING
Volume 29, Issue 5, Pages 1129-1143

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TKDE.2017.2650229

Keywords

Consensus clustering; ensemble clustering; spectral clustering; co-association matrix; weighted K-means

Funding

  1. NSFC [71531001, 71322104, 71471009, U1636210, 71490723, 71171007]
  2. National High Technology Research and Development Program of China [SS2014AA012303]
  3. National Center for International Joint Research on E-Business Information Processing [2013B01035]
  4. Fundamental Research Funds for the Central Universities
  5. Australian Research Council [FT-130101457, DP-140102164, LP-150100671]
  6. US National Science Foundation CNS award [1314484]
  7. US National Science Foundation [1651902]
  8. Direct For Computer & Info Scie & Enginr
  9. Division Of Computer and Network Systems [1314484] Funding Source: National Science Foundation

Ask authors/readers for more resources

As a promising way for heterogeneous data analytics, consensus clustering has attracted increasing attention in recent decades. Among various excellent solutions, the co-association matrix based methods form a landmark, which redefines consensus clustering as a graph partition problem. Nevertheless, the relatively high time and space complexities preclude it from wide real-life applications. We, therefore, propose Spectral Ensemble Clustering (SEC) to leverage the advantages of co-association matrix in information integration but run more efficiently. We disclose the theoretical equivalence between SEC and weighted K-means clustering, which dramatically reduces the algorithmic complexity. We also derive the latent consensus function of SEC, which to our best knowledge is the first to bridge co-association matrix based methods to the methods with explicit global objective functions. Further, we prove in theory that SEC holds the robustness, generalizability, and convergence properties. We finally extend SEC to meet the challenge arising from incomplete basic partitions, based on which a row-segmentation scheme for big data clustering is proposed. Experiments on various real-world data sets in both ensemble and multi-view clustering scenarios demonstrate the superiority of SEC to some state-of-the-art methods. In particular, SEC seems to be a promising candidate for big data clustering.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available