4.6 Article

A reinforcement learning approach for thermostat setpoint preference learning

期刊

BUILDING SIMULATION
卷 -, 期 -, 页码 -

出版社

TSINGHUA UNIV PRESS
DOI: 10.1007/s12273-023-1056-7

关键词

reinforcement learning; preference learning; occupant-centric controls; smart thermostats; off-policy learning

向作者/读者索取更多资源

Occupant-centric controls (OCC) is an indoor climate control approach that utilizes occupant feedback to operate building energy systems. This paper introduces a new off-policy reinforcement learning (RL) algorithm that imitates occupant behavior by utilizing unsolicited occupant thermostat overrides. Simulation results show that the RL algorithm can rapidly learn occupant preferences and achieve substantial energy savings, although the impact varies depending on occupants' preferences and thermostat use behavior stochasticity.
Occupant-centric controls (OCC) is an indoor climate control approach whereby occupant feedback is used in the sequence of operation of building energy systems. While OCC has been used in a wide range of building applications, an OCC category that has received considerable research interest is learning occupants' thermal preferences through their thermostat interactions and adapting temperature setpoints accordingly. Many recent studies used reinforcement learning (RL) as an agent for OCC to optimize energy use and occupant comfort. These studies depended on predicted mean vote (PMV) models or constant comfort ranges to represent comfort, while only few of them used thermostat interactions. This paper addresses this gap by introducing a new off-policy reinforcement learning (RL) algorithm that imitates the occupant behaviour by utilizing unsolicited occupant thermostat overrides. The algorithm is tested with a number of synthetically generated occupant behaviour models implemented via the Python API of EnergyPlus. The simulation results indicate that the RL algorithm could rapidly learn preferences for all tested occupant behaviour scenarios with minimal exploration events. While substantial energy savings were observed with most occupant scenarios, the impact on the energy savings varied depending on occupants' preferences and thermostat use behaviour stochasticity.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据