4.7 Article

Multi-agent deep reinforcement learning based decision support model for resilient community post-hazard recovery

Journal

RELIABILITY ENGINEERING & SYSTEM SAFETY
Volume 242, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.ress.2023.109754

Keywords

Disaster resilience; Multi-agent reinforcement learning; Decision support model; Interdependent infrastructure systems; Post -hazard recovery

Ask authors/readers for more resources

This study proposes a novel decision support model that integrates graph theory and neural network to determine optimal restoration policies for maximizing disaster resilience. The model utilizes stochastic scheduling and deep reinforcement learning algorithm to efficiently make repair decisions, and demonstrates higher performance and computational efficiency in a case study.
After a city-scale natural hazard, policymakers should plan sound decisions on the repair sequence to ensure the resilient recovery of the community, which consists of interdependent infrastructures. Stochastic scheduling for repairing interdependent infrastructure systems is a difficult control problem with huge decision spaces. This study proposes a novel decision support model to determine the optimal restoration policies for the purpose of maximizing disaster resilience. A simulation environment is first developed, consisting of hazard intensity assessment, components damage evaluation, system recovery simulation, and resilience quantification. The graph theory is utilized to represent the interdependencies among different systems, and the heterogeneous graph neural network is integrated into this framework to extract the topology and interdependency information of the whole community. The optimal repair policies approximated by neural networks are trained by a multi-agent deep reinforcement learning algorithm, considering uncertainties of the restoration process. The superiority and efficiency of the proposed method are demonstrated through a case study of the Tsinghua University campus, where different decision-making objectives are considered. The results show that the recovery trajectories determined by the proposed model have the highest performance compared to conventional methods. Besides, the proposed methodology based on transfer learning can achieve high computational efficiency for new damage scenarios. This model is promising to be a high-performance, robust decision-support tool for post -hazard repairing decisions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available