4.6 Article

Multi-Microgrid Energy Management Strategy Based on Multi-Agent Deep Reinforcement Learning with Prioritized Experience Replay

期刊

APPLIED SCIENCES-BASEL
卷 13, 期 5, 页码 -

出版社

MDPI
DOI: 10.3390/app13052865

关键词

multi-microgrid; multi-agent; energy management; reinforcement learning

向作者/读者索取更多资源

This paper proposes a multi-agent reinforcement learning algorithm for real-time energy management of a multi-microgrid (MMG) system. The MMG is connected to a distribution network, and the distribution network operator and each microgrid are modeled as autonomous agents. Each agent makes decisions based on local information, and the problem is modeled as a Markov game and solved using the prioritized multi-agent deep deterministic policy gradient (PMADDPG) algorithm.
The multi-microgrid (MMG) system has attracted more and more attention due to its low carbon emissions and flexibility. This paper proposes a multi-agent reinforcement learning algorithm for real-time energy management of an MMG. In this problem, the MMG is connected to a distribution network (DN). The distribution network operator (DSO) and each microgrid (MG) are modeled as autonomous agents. Each agent makes decisions to suit its interests based on local information. The decision-making problem of multiple agents is modeled as a Markov game and solved by the prioritized multi-agent deep deterministic policy gradient (PMADDPG), where only local observation is required for each agent to make decisions, the centralized training mechanism is applied to learn coordination strategy, and a prioritized experience replay (PER) strategy is adopted to improve learning efficiency. The proposed method can deal with the non-stationary problems in the process of a multi-agent game with partial observable information. In the execution stage, all trained agents are deployed in a distributed manner and make decisions in real time. Simulation results show that according to the proposed method, the training process of a multi-agent game is accelerated, and multiple agents can make optimal decisions only by local information.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据