4.7 Article

Optimal Policy Characterization Enhanced Actor-Critic Approach for Electric Vehicle Charging Scheduling in a Power Distribution Network

期刊

IEEE TRANSACTIONS ON SMART GRID
卷 12, 期 2, 页码 1416-1428

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TSG.2020.3028470

关键词

Electric vehicle charging; Optimal scheduling; Stochastic processes; Distribution networks; Reinforcement learning; Solar power generation; Dynamic programming; deep reinforcement learning; electric vehicle charging; actor-critic approach; power distribution network

资金

  1. Shun Hing Institute of Advanced Engineering, the Chinese University of Hong Kong [RNE-p5-19]

向作者/读者索取更多资源

In this study, scheduling large-scale electric vehicle charging in a power distribution network under random renewable generation and electricity prices is explored. The nodal multi-target (NMT) characterization of the optimal scheduling policy is established to reduce the dimensionality of neural network outputs without compromising optimality. The proposed SAC + NMT approach outperforms existing deep reinforcement learning methods in numerical experiments on the IEEE 37-node test feeder.
We study the scheduling of large-scale electric vehicle (EV) charging in a power distribution network under random renewable generation and electricity prices. The problem is formulated as a stochastic dynamic program with unknown state transition probability. To mitigate the curse of dimensionality, we establish the nodal multi-target (NMT) characterization of the optimal scheduling policy: all EVs with the same deadline at the same bus should be charged to approach a single target of remaining energy demand. We prove that the NMT characterization is optimal under arbitrarily random system dynamics. To adaptively learn the dynamics of system uncertainty, we propose a model-free soft-actor-critic (SAC) based method to determine the target levels for the characterized NMT policy. The proposed SAC + NMT approach significantly outperforms existing deep reinforcement learning methods (in our numerical experiments on the IEEE 37-node test feeder), as the established NMT characterization sharply reduces the dimensionality of neural network outputs without loss of optimality.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据