期刊
INTEGRATED COMPUTER-AIDED ENGINEERING
卷 25, 期 4, 页码 335-348出版社
IOS PRESS
DOI: 10.3233/ICA-180580
关键词
Deep learning; time series forecasting; big data
类别
资金
- Spanish Ministry of Economy and Competitiveness [TIN2014-55894-C2-R, TIN2017-88209-C2-1-R, P12-TIC-1728]
- Junta de Andalucia [TIN2014-55894-C2-R, TIN2017-88209-C2-1-R, P12-TIC-1728]
This paper presents a method based on deep learning to deal with big data times series forecasting. The deep feed forward neural network provided by the H2O big data analysis framework has been used along with the Apache Spark platform for distributed computing. Since H2O does not allow the conduction of multi-step regression, a general-purpose methodology that can be used for prediction horizons with arbitrary length is proposed here, being the prediction horizon, h, the number of future values to be predicted. The solution consists in splitting the problem into h forecasting subproblems, being h the number of samples to be simultaneously predicted. Thus, the best prediction model for each subproblem can be obtained, making easier its parallelization and adaptation to the big data context. Moreover, a grid search is carried out to obtain the optimal hyperparameters of the deep learning-based approach. Results from a real-world dataset composed of electricity consumption in Spain, with a ten-minute frequency sampling rate, from 2007 to 2016 are reported. In particular, the accuracy and runtimes versus computing resources and size of the dataset are analyzed. Finally, the performance and the scalability of the proposed method is compared to other recently published techniques, showing to be a suitable method to process big data time series.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据