4.7 Article

Optimization of the Operation and Maintenance of renewable energy systems by Deep Reinforcement Learning

期刊

RENEWABLE ENERGY
卷 183, 期 -, 页码 752-763

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.renene.2021.11.052

关键词

Renewable energy systems; Wind farm; Operation and maintenance; Prognostics and health management; Optimization; Deep reinforcement learning

向作者/读者索取更多资源

This study utilizes Prognostics & Health Management (PHM) capabilities to support equipment of renewable energy systems, estimating their health state and predicting Remaining Useful Life (RUL). The authors propose a novel formulation of the optimization problem and solve it using Deep Reinforcement Learning (DRL). The approach, tested in a wind farm O&M problem, outperforms other DRL algorithms and automatically discovers the best performing policy.
Equipment of renewable energy systems are being supported by Prognostics & Health Management (PHM) capabilities to estimate their current health state and predict their Remaining Useful Life (RUL). The PHM health state estimates and RUL predictions can be used for the optimization of the systems Operation and Maintenance (O&M). This is an ambitious and challenging task, which requires to consider many factors, including the availability of maintenance crews, the variability of energy demand and production, the influence of the operating conditions on equipment performance and degradation and the long time horizons of renewable energy systems usage. We develop a novel formulation of the O&M optimization as a sequential decision problem and we resort to Deep Reinforcement Learning (DRL) to solve it. The proposed solution approach combines proximal policy optimization, imitation learning, for pre-training the learning agent, and a model of the environment which describes the renewable energy system behavior. The solution approach is tested by its application to a wind farm O&M problem. The optimal solution found is shown to outperform those provided by other DRL algorithms. Also, the approach does not require to select a-priori a maintenance strategy, but, rather, it discovers the best performing policy by itself.(c) 2021 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据