4.7 Article

Online Tensor Low-Rank Representation for Streaming Data Clustering

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCSVT.2022.3207484

Keywords

Tensors; Optimization; Clustering algorithms; Matrix decomposition; Memory management; Iterative methods; Costs; Low-rank tensor recovery; missing data; online learning; tensor data clustering; tensor low-rank representation

Ask authors/readers for more resources

Tensor low-rank representation (TLRR) has been widely studied for capturing the intrinsic low-rank structure of tensor data. However, TLRR suffers from high computational complexity for large-scale data. In this paper, an online low-rank tensor subspace clustering algorithm (OLRTSC) is proposed to address this issue by leveraging the stochastic optimization technique. OLRTSC is online, can handle dynamic data, avoids computing t-SVD, and reduces storage cost significantly.
Tensor low-rank representation (TLRR) has attracted increasing attention in recent years due to its effectiveness in capturing the intrinsic low-rank structure of tensor data. However, it is known that TLRR suffers from high computational complexity for large-scale data because it minimizes the tensor rank of the representation tensor whose size is proportional to the sample size. This paper develops a streaming algorithm-termed online low-rank tensor subspace clustering (OLRTSC)-for low-rank tensor data recovery and clustering based on the tensor singular value decomposition (t-SVD) algebraic framework. The key idea of OLRTSC is the non-convex reformulation of the objective function of TLRR by decomposing the tensor nuclear norm into an explicit tensor product of two low-rank tensors, which can be solved by leveraging the stochastic optimization technique. Compared to batch TLRR, OLRTSC is online by nature, can handle dynamic data, avoids computing t-SVD of the representation tensor, and reduces the storage cost significantly. This paper provides the theoretical guarantee that the sequence of the solutions produced by OLRTSC converges almost surely to a stationary point of the objective function. Moreover, an extension of OLRTSC is also proposed to handle the case when parts of the data are missing. Finally, experimental results on both synthetic and real data demonstrate the efficiency and robustness of the proposed approaches.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available