4.7 Article

Dynamic energy dispatch strategy for integrated energy system based on improved deep reinforcement learning

期刊

ENERGY
卷 235, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.energy.2021.121377

关键词

Dynamic energy dispatch; Integrated energy system; Deep reinforcement learning; Improved deep deterministic policy gradient; Uncertainties

资金

  1. National Key Research and Development Program of China [2017YFE0132100]
  2. National Natural Science Foundation of China [61971305]
  3. Key Research and Development Program of Tianjin [20YFYSGX00060]

向作者/读者索取更多资源

This study introduces a novel model-free dynamic dispatch strategy for integrated energy systems based on improved deep reinforcement learning, which can adapt to the stochastic fluctuations of renewable energy generation and energy demands to optimize the operational efficiency of the system.
Dynamic energy dispatch is an integral part of the operation optimization of integrated energy systems (IESs). Most existing dynamic dispatch schemes depend heavily on explicit forecast or mathematical models of the future uncertainties. Due to the randomness of renewable energy generation and energy demands, these approaches are limited by the accuracy of forecasting or model. A novel model-free dynamic dispatch strategy for IES based on improved deep reinforcement learning (DRL) is proposed to solve the problem. The IES dynamic dispatch problem is formulated as a Markov decision process (MDP), in which the uncertainties of renewable generation, electric load and heat load are considered. For solving the MDP, an improved deep deterministic policy gradient (DDPG) algorithm using prioritized experience replay mechanism and L-2 regularization is developed, so as to improve the policy quality and learning efficiency of the dispatch strategy. The proposed approach does not require any forecast information or distribution knowledge, and can adaptively respond to the stochastic fluctuations of the supply and demands. Simulation results show the proposed dispatch strategy has faster convergence and lower operating costs than original DDPG-based strategy. In addition, the advantages of the proposed approach in terms of cost-effectiveness and stochastic environmental adaptation are validated. (C) 2021 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据