4.6 Article

Energy-efficient control of thermal comfort in multi-zone residential HVAC via reinforcement learning

Journal

CONNECTION SCIENCE
Volume 34, Issue 1, Pages 2364-2394

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1080/09540091.2022.2120598

Keywords

Reinforcement learning; deep reinforcement learning; multi-zone residential HVAC; energy consumption; thermal comfort

Funding

  1. Primary Research and Development Plan of China [2020YFC2006602]
  2. National Natural Science Foundation of China [62072324, 61876217, 61876121, 61772357]
  3. University Natural Science Foundation of Jiangsu Province [21KJA520005]
  4. Primary Research and Development Plan of Jiangsu Province [BE2020026]
  5. Natural Science Foundation of Jiangsu Province [BK20190942]

Ask authors/readers for more resources

This paper proposes a deep reinforcement learning-based thermal comfort control method for multi-zone residential HVAC systems. By designing an SVR-DNN model and applying the optimization strategy based on Deep Deterministic Policy Gradient (DDPG), the method minimizes energy consumption while satisfying occupants' thermal comfort.
Energy efficient control of thermal comfort has been already an important part of residential heating, ventilation, and air conditioning (HVAC) systems. However, the optimisation of energy saving control for thermal comfort is not an easy task due to the complex dynamics of HVAC systems, the dynamics of thermal comfort and the trade-off between energy saving and thermal comfort. To solve the above problem, we propose a deep reinforcement learning-based thermal comfort control method in multi-zone residential HVAC. In this paper, firstly we design a SVR-DNN model, consisting of Support Vector Regression and a Deep Neural Network to predict thermal comfort value. Then, we apply Deep Deterministic Policy Gradient (DDPG) based on the output of the SVR-DNN model to achieve an optimal HVAC thermal comfort control strategy. This method can minimise energy consumption while satisfying occupants' thermal comfort. The experimental results show that our method can improve thermal comfort prediction performance by 20.5% compared with DNN; compared with deep Q-network (DQN), energy consumption and thermal comfort violation can be reduced by 3.52% and 64.37% respectively.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available