4.6 Article

Optimal home energy management strategy: A reinforcement learning method with actor-critic using Kronecker-factored trust region

Journal

ELECTRIC POWER SYSTEMS RESEARCH
Volume 212, Issue -, Pages -

Publisher

ELSEVIER SCIENCE SA
DOI: 10.1016/j.epsr.2022.108617

Keywords

Appliance scheduling; Demand response; Deep reinforcement learning; Home energy management; Markov decision process

Funding

  1. Natural Science Foundation of China [52077060]

Ask authors/readers for more resources

This paper proposes an energy scheduling strategy optimization framework for a home energy management system based on photovoltaic and storage, aiming to reduce environmental impact and costs by scheduling household appliances optimally. Utilizing a Markov decision process model to describe uncertainties in energy usage behavior and real-time electricity prices, a model-free energy scheduling approach based on ACKTR is proposed, with high sampling efficiency.
The global development trend of building decentralized and low-carbon energy systems has given rise to challenges in the demand side energy management. With the overwhelming increase in the domestic electricity demand and distributed energy penetration, scheduling residential energy consumption has become essential for decreasing environmental impact. To this end, the paper proposes an energy scheduling strategy optimization framework for a home energy management system devised with photovoltaic and storage to realize the optimal scheduling of household appliances. Uncertainties in energy usage behaviour and real-time electricity prices were considered and described using a Markov decision process model. Subsequently, to search for the optimal appliance scheduling policy in a complex and changing environment, a model-free energy scheduling approach was proposed based on actor-critic using the Kronecker-Factored Trust Region (ACKTR). Unlike conventional deep reinforcement learning methods, the proposed ACKTR-based approach has high sampling efficiency and can deal with both discrete and continuous control actions to jointly optimize the scheduling strategies of various types of appliances. Finally, the numerical results of case studies validate the effectiveness and demonstrate that the proposed approach can significantly reduce costs while maintaining satisfaction, with a remarkable 25.37% cost saving on a typical test day.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available