4.8 Article

Augmenting energy time-series for data-efficient imputation of missing values

Journal

APPLIED ENERGY
Volume 334, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.apenergy.2023.120701

Keywords

Missing data; Data augmentation; Data scarcity; Building energy data; Deep learning

Ask authors/readers for more resources

This study investigates the use of data augmentation techniques for reconstructing missing energy time-series in limited data scenarios. A convolutional denoising autoencoder is chosen as the base imputation model, and an optimal augmentation rate is determined based on preliminary results. The results show that augmenting a nine days-long training set 80 times can significantly reduce the initial average RMSE and outperform benchmark methods.
This study explores the applicability of data augmentation techniques for reconstructing missing energy time -series in limited data regimes. In particular, multiple synthetic copies of a relatively small training dataset are stacked together with pseudo-random noise. First, an existing convolutional denoising autoencoder is selected from a previous work, as the base imputation model of this study. Then, an optimal augmentation rate, which minimizes the training set of the model, is chosen based on the preliminary results obtained from one building. The results proved that, augmenting 80 times a nine days-long training set could reduce the initial average root mean squared error (RMSE) by 37% and 48%, for continuous and random missing scenarios. Additionally, the augmented model outperformed the benchmark methods with 23% and 12% lower average RMSE. No additional tuning or calibration costs were required for the existing base imputation model. Therefore, the presented data augmentation technique could significantly reduce the expensive computational costs associated with deep learning models.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available