4.7 Article

A dual-stage attention-based Conv-LSTM network for spatio-temporal correlation and multivariate time series prediction

Journal

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS
Volume 36, Issue 5, Pages 2036-2057

Publisher

WILEY-HINDAWI
DOI: 10.1002/int.22370

Keywords

attention mechanism; long-short term memory; multivariate time series; prediction; spatio-temporal correlation

Funding

  1. Fundamental Research Funds for the Central Universities [2019BSCX14]
  2. National Key R&D Program of China [2020YFF0304901]

Ask authors/readers for more resources

The study introduces a convolutional LSTM network model with two-stage attention for multivariate time series prediction, effectively addressing the issue of insufficient time dependency in MTS prediction. The model improves prediction accuracy by extracting spatio-temporal correlations of MTS and utilizing attention mechanism, showing promising application prospects.
Multivariate time series (MTS) prediction aims at predicting future time series by extracting multiple forms of dependencies of past time series. Traditional prediction methods and deep learning-based prediction methods focus on extracting the dynamic relationships of certain aspects of MTS, especially the temporal characteristics, often neglecting the spatial and temporal dynamic correlations of MTS. Inspired by convolution neural network (CNN) and attention mechanism, this paper proposes a convolution LSTM network model based on MTS prediction with two-stage attention. Specifically, we first propose a new MTS preprocessing method to perform convolution operations better. Then convolution layer extracts spatial correlation of MTS and LSTM model extracts temporal correlation. It is worth mentioning that the combination of attention mechanism and LSTM can effectively solve the problem of insufficient time dependency in MTS prediction. In addition, dual-stage attention mechanism can effectively eliminate irrelevant information, select the relevant exogenous sequence, give it higher weight, and increase the past value of the target sequence to further eliminate irrelevant information. Finally, the MTS spatio-temporal correlation is extracted to improve the prediction accuracy, and the model is interpreted. Experimental results show that the model has broad application prospects. Experiments based on typical datasets of finance, environment, and energy determine the optimal window size and hidden size of the prediction, and demonstrate that the model achieves the state-of-the-art effect compared to the other four deep learning models. On top of that, the model is not only suitable for single-step prediction of MTS, but also suitable for multistep prediction of time step in a certain range.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available