4.7 Article

Fast Compressive Spectral Clustering for Large-Scale Sparse Graph

Journal

IEEE TRANSACTIONS ON BIG DATA
Volume 8, Issue 1, Pages 193-202

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TBDATA.2019.2931532

Keywords

Spectral clustering; compressive spectral embedding; graph signal filter; random sampling; interpolation

Funding

  1. National Natural Science Foundation of China [61772541, 61872376, 61572324]
  2. Joint Key Project of NSFC [U1736207]
  3. US National Science Foundation [NSF 1547102, 1564097]
  4. IBM faculty award

Ask authors/readers for more resources

Spectral clustering (SC) is widely used in industrial product analysis as an unsupervised learning method. Compressive spectral clustering (CSC) accelerates clustering effectively but suffers from expensive computation and time-consuming interpolation. The proposed fast compressive spectral clustering (FCSC) method addresses these issues by assuming eigenvalues satisfy local uniform distribution and reconstructing denoised Laplacian matrix with low-dimensional representation. The experimental results show that FCSC significantly reduces computation time while maintaining high clustering accuracy.
Spectral clustering (SC) is an unsupervised learning method that has been widely used in industrial product analysis. Compressive spectral clustering (CSC) effectively accelerates clustering by leveraging graph filter and random sampling techniques. However, CSC suffers from two major problems. First, the direct use of the dichotomy and eigencount techniques for estimating Laplacian matrix's kth eigenvalue is expensive. Second, the computation of interpolation is time-consuming because it requires to repeat matrix-vector product for every cluster in each iteration. To address these problems, we propose a new method called fast compressive spectral clustering (FCSC). Our method addresses the first problem by assuming that the eigenvalues approximately satisfy local uniform distribution, and addresses the second problem by recalculating the pairwise similarity between nodes with low-dimensional representation to reconstruct denoised laplacian matrix. The time complexity of reconstruction is linear with the number of non-zeros in Laplacian matrix. As experimentally demonstrated on both artificial and real-world datasets, our method significantly reduces the computation time while preserving high clustering accuracy comparable to previous designs, demonstrating the effectiveness of FCSC.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available