☆ 4.7 Article

A sequential decision problem formulation and deep reinforcement learning solution of the optimization of O&M of cyber-physical energy systems (CPESs) for reliable and safe power production and supply

RELIABILITY ENGINEERING & SYSTEM SAFETY (2023)

期刊

RELIABILITY ENGINEERING & SYSTEM SAFETY

卷 235, 期 -, 页码 -

出版社

ELSEVIER SCI LTD

DOI: 10.1016/j.ress.2023.109231

关键词

Cyber-Physical Energy System (CPES); Operation & Maintenance (O & M); Deep Reinforcement Learning (DRL); Nuclear Power Plant (NPP); Optimization; Advanced Lead -cooled Fast Reactor European; Demonstrator (ALFRED)

类别

Engineering, Industrial Operations Research & Management Science

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper discusses the O&M strategies for reliable and safe production and supply of CPESs, considering the uncertainty in energy demand and supply due to renewable energy sources and the need to avoid severe accidents for safety reasons. A Deep Reinforcement Learning approach is developed to search for the best strategy, taking into account the health conditions and remaining useful life of system components, and possible accident scenarios. The approach integrates Proximal Policy Optimization and Imitation Learning, and incorporates a CPES model with component RUL estimator and failure process model. An application to the ALFRED reactor demonstrates that the optimal solution found by DRL outperforms state-of-the-art O&M policies.

The Operation & Maintenance (O&M) of Cyber-Physical Energy Systems (CPESs) is driven by reliable and safe production and supply, that need to account for flexibility to respond to the uncertainty in energy demand and also supply due to the stochasticity of Renewable Energy Sources (RESs); at the same time, accidents of severe consequences must be avoided for safety reasons. In this paper, we consider O&M strategies for CPES reliable and safe production and supply, and develop a Deep Reinforcement Learning (DRL) approach to search for the best strategy, considering the system components health conditions, their Remaining Useful Life (RUL), and possible accident scenarios. The approach integrates Proximal Policy Optimization (PPO) and Imitation Learning (IL) for training RL agent, with a CPES model that embeds the components RUL estimator and their failure process model. The novelty of the work lies in i) taking production plan into O&M decisions to implement maintenance and operate flexibly; ii) embedding the reliability model into CPES model to recognize safety related components and set proper maintenance RUL thresholds. An application, the Advanced Lead-cooled Fast Reactor European Demonstrator (ALFRED), is provided. The optimal solution found by DRL is shown to outperform those provided by state-of-the-art O&M policies.

A sequential decision problem formulation and deep reinforcement learning solution of the optimization of O&M of cyber-physical energy systems (CPESs) for reliable and safe power production and supply

期刊

RELIABILITY ENGINEERING & SYSTEM SAFETY

出版社

ELSEVIER SCI LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A sequential decision problem formulation and deep reinforcement learning solution of the optimization of O&M of cyber-physical energy systems (CPESs) for reliable and safe power production and supply

期刊

RELIABILITY ENGINEERING & SYSTEM SAFETY

出版社

ELSEVIER SCI LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文