4.7 Article

Truncated tensor Schatten p-norm based approach for spatiotemporal traffic data imputation with complicated missing patterns

Journal

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.trc.2022.103737

Keywords

Spatiotemporal traffic data; Missing data imputation; Low-rank tensor completion; Truncated tensor Schatten p-norm; Fiber-like missing; Generalized soft-thresholding

Funding

  1. National Natural Science Foundation of China [52125208]
  2. Shanghai Municipal Science and Technology, China Major Project [2021SHZDZX0100]

Ask authors/readers for more resources

This paper explores an innovative method for imputing missing spatiotemporal traffic data using tensors. The proposed method utilizes a nonconvex truncated Schatten p-norm (TSpN) to approximate tensor rank and combines the alternating direction method of multipliers (ADMM) with generalized soft-thresholding (GST) to derive global optimal solutions. Experimental results show that the proposed method performs well under various missing cases, even with high rates of data loss, outperforming other state-of-the-art tensor-based imputation models in almost all scenarios.
Rapid advances in sensor, wireless communication, cloud computing and data science have brought unprecedented amount of data to assist transportation engineers and researchers in making better decisions. However, traffic data in reality often has corrupted or incomplete values due to detector and communication malfunctions. Data imputation is thus required to ensure the effectiveness of downstream data-driven applications. To this end, numerous tensor based methods treating the imputation problem as the low-rank tensor completion (LRTC) have been attempted in previous works. To tackle rank minimization, which is at the core of the LRTC, most of aforementioned methods utilize the tensor nuclear norm (NN) as a convex surrogate for the minimization. However, the over-relaxation issue in NN refrains it from desirable performance in practice. In this paper, we define an innovative nonconvex truncated Schatten p-norm for tensors (TSpN) to approximate tensor rank and impute missing spatiotemporal traffic data under the LRTC framework. We model traffic data into a third-order tensor structure of time intervalsxlocations (sensors)xdays and introduce four complicated missing patterns, including random missing and three fiber-like missing cases according to the tensor mode -n fibers. Despite nonconvexity of the objective function in our model, we derive the global optimal solutions by integrating the alternating direction method of multipliers (ADMM) with generalized soft-thresholding (GST). In addition, we design a truncation rate decay strategy to deal with varying missing rate scenarios. Comprehensive experiments are finally conducted using four different types of real-world spatiotemporal traffic datasets, which demonstrate that the proposed LRTC-TSpN method performs well under various missing cases even with high rates of data loss, meanwhile outperforming other state-of-the-art tensor-based imputation models in almost all scenarios. In general, the performance of the proposed method achieves up to a 52% improvement in mean absolute error (MAE) compared to baseline methods even when the missing ratio is as high as 90%. Moreover, both theoretical and empirical evidences indicate that our method has robust and efficient convergence performances.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available