4.7 Article

Energy-efficient heating control for nearly zero energy residential buildings with deep reinforcement learning

Journal

ENERGY
Volume 264, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.energy.2022.126209

Keywords

HVAC; Optimal control; Reinforcement learning; Deep Q learning; Prioritized replay; Model -free control

Ask authors/readers for more resources

This paper proposes a model-free optimal control method based on deep reinforcement learning for controlling the heat pump start/stop and room temperature setting in residential buildings, aiming to improve energy efficiency of demand-side. The simulation results show that the method can achieve the highest comprehensive reward by coordinating monitoring data, weather forecasts, and building thermal inertia.
Controlling Heating, Ventilation and Air Conditioning (HVAC) systems is critical to improving energy efficiency of demand-side. In this paper, a model-free optimal control method based on deep reinforcement learning is proposed to control the heat pump start/stop and room temperature setting in residential buildings. The opti-mization goal of this method is to obtain the highest comprehensive reward which considering thermal comfort and energy cost. Firstly, the randomness, learning process, thermal comfort and energy consumption of the model-free controller are systematically investigated by a simulation system based on measured data. The results show that randomness has a significant impact on the initial performance and convergence speed of the model -free controller; The model-free controller has a linear accumulation of comprehensive rewards during the learning process, and the slope of the accumulated comprehensive rewards can be used to determine whether the controller converges; The model-free controller coordinates monitoring data, weather forecasts and building thermal inertia to achieve the highest comprehensive reward. Afterwards, the model-free controller was verified in a nearly zero energy residential building in Beijing, China. The results show that model-free controller im-proves the comprehensive reward by 15.3% compared to rule-based method.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available