4.6 Article

Multi-agent deep reinforcement learning-based optimal energy management for grid-connected multiple energy carrier microgrids

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.ijepes.2023.109292

关键词

Multi-agent deep reinforcement learning; Smart grid; Multiple energy carrier microgrids; System optimization

向作者/读者索取更多资源

This paper introduces a novel optimal energy management method based on the multi-agent deep reinforcement learning (MA-DRL) approach. The method utilizes deep neural networks and stacked denoising auto-encoders to learn strategies, and employs multi-agent deep deterministic policy gradient learning capability. The MA-DRL method is utilized to find the optimal strategy for managing energy resources under the Markov decision process framework, taking into account the distinct properties of electric and thermal energies.
Multi-agent deep reinforcement learning (MA-DRL) method provides a groundbreaking approach to tackling computational problems in power systems, particularly for distributed energy resources that have been widely adopted to advance energy sustainability. This paper presents a novel optimal energy management based on proposed MA-DRL method. This method employs deep neural network to learn strategy based on stackeddenoising auto-encoders and multi-agent deep deterministic policy gradient learning capability. The MA-DRL method is adopted to find the optimal strategy of the optimal energy management problem under the Markov decision process framework. This method aims to coordinate multiple energies and achieve optimal operation over a variety of hourly dispatches while taking into account the distinct properties of electric and thermal energies. The primary challenge of the planning and operation of multiple energy carrier microgrids (MECMs) is determining the optimal interaction between renewable energy resources, energy storage systems, power-to thermal conversion systems, and upstream power grid in order to improve overall energy utilization efficiency. The presented robust method can adaptively derive the optimal operation for MECMs through centralized learning and decentralized implementation. The optimization problem is employed in this study to concurrently reduce the total emissions and the operating costs while considering engineering design constraints. Finally, to demonstrate the efficiency of the proposed method, it is verified on an integrated modified IEEE 33-bus and 8 node gas systems.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据