Journal
IEEE TRANSACTIONS ON BIG DATA
Volume 8, Issue 1, Pages 193-202Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TBDATA.2019.2931532
Keywords
Spectral clustering; compressive spectral embedding; graph signal filter; random sampling; interpolation
Funding
- National Natural Science Foundation of China [61772541, 61872376, 61572324]
- Joint Key Project of NSFC [U1736207]
- US National Science Foundation [NSF 1547102, 1564097]
- IBM faculty award
Ask authors/readers for more resources
Spectral clustering (SC) is widely used in industrial product analysis as an unsupervised learning method. Compressive spectral clustering (CSC) accelerates clustering effectively but suffers from expensive computation and time-consuming interpolation. The proposed fast compressive spectral clustering (FCSC) method addresses these issues by assuming eigenvalues satisfy local uniform distribution and reconstructing denoised Laplacian matrix with low-dimensional representation. The experimental results show that FCSC significantly reduces computation time while maintaining high clustering accuracy.
Spectral clustering (SC) is an unsupervised learning method that has been widely used in industrial product analysis. Compressive spectral clustering (CSC) effectively accelerates clustering by leveraging graph filter and random sampling techniques. However, CSC suffers from two major problems. First, the direct use of the dichotomy and eigencount techniques for estimating Laplacian matrix's kth eigenvalue is expensive. Second, the computation of interpolation is time-consuming because it requires to repeat matrix-vector product for every cluster in each iteration. To address these problems, we propose a new method called fast compressive spectral clustering (FCSC). Our method addresses the first problem by assuming that the eigenvalues approximately satisfy local uniform distribution, and addresses the second problem by recalculating the pairwise similarity between nodes with low-dimensional representation to reconstruct denoised laplacian matrix. The time complexity of reconstruction is linear with the number of non-zeros in Laplacian matrix. As experimentally demonstrated on both artificial and real-world datasets, our method significantly reduces the computation time while preserving high clustering accuracy comparable to previous designs, demonstrating the effectiveness of FCSC.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available