4.7 Article

A scalable approach based on deep learning for big data time series forecasting

Journal

INTEGRATED COMPUTER-AIDED ENGINEERING
Volume 25, Issue 4, Pages 335-348

Publisher

IOS PRESS
DOI: 10.3233/ICA-180580

Keywords

Deep learning; time series forecasting; big data

Funding

  1. Spanish Ministry of Economy and Competitiveness [TIN2014-55894-C2-R, TIN2017-88209-C2-1-R, P12-TIC-1728]
  2. Junta de Andalucia [TIN2014-55894-C2-R, TIN2017-88209-C2-1-R, P12-TIC-1728]

Ask authors/readers for more resources

This paper presents a method based on deep learning to deal with big data times series forecasting. The deep feed forward neural network provided by the H2O big data analysis framework has been used along with the Apache Spark platform for distributed computing. Since H2O does not allow the conduction of multi-step regression, a general-purpose methodology that can be used for prediction horizons with arbitrary length is proposed here, being the prediction horizon, h, the number of future values to be predicted. The solution consists in splitting the problem into h forecasting subproblems, being h the number of samples to be simultaneously predicted. Thus, the best prediction model for each subproblem can be obtained, making easier its parallelization and adaptation to the big data context. Moreover, a grid search is carried out to obtain the optimal hyperparameters of the deep learning-based approach. Results from a real-world dataset composed of electricity consumption in Spain, with a ten-minute frequency sampling rate, from 2007 to 2016 are reported. In particular, the accuracy and runtimes versus computing resources and size of the dataset are analyzed. Finally, the performance and the scalability of the proposed method is compared to other recently published techniques, showing to be a suitable method to process big data time series.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available