4.3 Article

Dynamic maintenance model for a repairable multi-component system using deep reinforcement learning

Journal

QUALITY ENGINEERING
Volume 34, Issue 1, Pages 16-35

Publisher

TAYLOR & FRANCIS INC
DOI: 10.1080/08982112.2021.1977950

Keywords

Deep reinforcement learning; Deep Q network; dynamic maintenance; gamma process; Markov decision process

Funding

  1. National Natural Science Foundation of China [71731008]

Ask authors/readers for more resources

In this paper, a new dynamic maintenance model for a degrading repairable system is provided using deep reinforcement learning method. The system state is the degradation level, and the maintenance problem is solved using Deep Q learning algorithm, avoiding discretization of the system states.
Using artificial intelligence for maintenance planning is useful for many industries to have a smart decision-making tool that delivers the best maintenance policy to minimize the expected maintenance costs. In this paper, a deep reinforcement learning method is used to provide a new dynamic maintenance model for a degrading repairable system subject to degradation and random shock. At any time, the degradation level of the system can be considered as the state of the system, and based on the available actions, it transits to different levels. The gamma process is used to formulate the degradation form of the system. The maintenance problem is formulated as a Markov decision process, and Deep Q learning algorithm is used to solve the problem. For most of the models in the literature, the degradation state of the system must be discretized. However, discretization of the degradation states brings inaccuracy and inefficiency to the model. In this paper, instead of discretizing the degradation state, we consider the exact level of degradation as the state of the system. The Deep Q learning method tries to recognize patterns instead of mapping every state to its best action. A neural network is trained during the learning process of the algorithm, and it can be used as a decision-making tool for the maintenance team to find the best maintenance action based on the current degradation level of the system. A numerical example illustrates how the deep reinforcement learning algorithm can be applied to find the optimal maintenance action at each degradation level.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available