☆ 4.7 Article

Deep reinforcement learning optimal control strategy for temperature setpoint real-time reset in multi-zone building HVAC system

APPLIED THERMAL ENGINEERING (2022)

期刊

APPLIED THERMAL ENGINEERING

卷 212, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.applthermaleng.2022.118552

关键词

Deep reinforcement learning; Multi-zone building; Optimal control; Temperature setpoint reset; EnergyPlus-Python co-simulation

类别

Thermodynamics Energy & Fuels Engineering, Mechanical Mechanics

资金

Hunan Provincial Innovation Foundation for Postgraduate Studies [QL20210107]
Hunan Provincial Research and Development Plan of Key Areas [2020DK2003]
Hunan Provincial Commercialization and Industrialization Plan of Scientific and Technological Achievements [2020GK2077]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study uses deep Q-learning control strategy to find a proper trade-off between HVAC system energy consumption and indoor temperature in a simulation environment. Through a case study experiment, the effectiveness of the strategy in a multi-zone building VAV system and the appropriate temperature setpoint reset sequence are verified.

Determining a proper trade-off between energy consumption and indoor thermal comfort is important for HVAC system control. Deep Q-learning (DQN) based multi-objective optimal control strategy is designed for temperature setpoint real-time reset to balance the energy consumption and indoor air temperature. In addition, this study develops an EnergyPlus-Python co-simulation testbed to evaluate DQN control strategy in a simulation environment. A case study experiment is conducted to evaluate the performance of DQN control strategy for real-time reset of supply air temperature and chilled supply water temperature setpoint in a multi-zone building VAV system. The developed EnergyPlus-Python co-simulation testbed is used to train and test the DQN control strategy for performance analysis. The applied DQN strategy leans to update control actions (i.e. temperature setpoint) through interaction with the simulation environment. Simulation results show that the DQN control strategy is effective in finding a proper trade-off between the energy consumption of HVAC system and indoor air temperature. Meanwhile, the DQN control strategy can find a proper temperature setpoint reset sequence in smaller training episodes, and the control actions can be stable after ten DQN training episodes. This study provides a preliminary direction of deep reinforcement learning control strategy for temperature setpoint realtime reset in multi-zone building HVAC systems.

Deep reinforcement learning optimal control strategy for temperature setpoint real-time reset in multi-zone building HVAC system

期刊

APPLIED THERMAL ENGINEERING

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Deep reinforcement learning optimal control strategy for temperature setpoint real-time reset in multi-zone building HVAC system

期刊

APPLIED THERMAL ENGINEERING

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文