Journal
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
Volume -, Issue -, Pages -Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNNLS.2022.3194247
Keywords
Automatic generation control (AGC); grid integration of large-scale electric vehicles and wind power; optimistic initialized double Q (OIDQ) method; reinforcement learning
Categories
Funding
- National Natural Science Foundation of China [51707102]
Ask authors/readers for more resources
This article proposes an improved reinforcement learning algorithm to solve the problem of frequency instability in power systems caused by large-scale electric vehicles and wind power grid connection. The algorithm expands the exploration space using an optimistic initialization principle and integrates double Q-learning to address the over-estimation issue. Simulation results demonstrate that the proposed algorithm obtains the global optimal solution and outperforms other reinforcement learning algorithms in terms of control performance.
In order to solve the problem of frequency instability of power system due to strong random disturbance caused by large-scale electric vehicles and wind power grid connection, an improved reinforcement learning algorithm, namely, optimistic initialized double Q, is proposed in this article from the perspective of automatic generation control. The proposed algorithm uses the optimistic initialization principle to expand the agent action exploration space, so as to prevent Q-learning from falling into local optimum by greedy strategy; meanwhile, it integrates double Q-learning to solve the problem of over-estimation of action value in traditional reinforcement learning based on Q-learning. In the algorithm, the hyperparameter alpha(tau) is introduced to improve the learning efficiency, and the reward b(tau) based on exploration times is introduced to increase the Q value estimation to drive the exploration of the algorithm, so as to obtain the optimal solution. By simulating the twoarea load frequency control model integrated with large-scale electric vehicles and the four-area interconnected power grid model integrated with large-scale wind power generation, it is verified that the proposed algorithm can obtain the global optimal solution, thus effectively solvinng the frequency instability caused by strong random disturbance in the grid-connected mode of large-scale wind power generation, and compared with many reinforcement learning algorithms, the proposed algorithm has better control performance.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available