4.7 Article

Multi-label fault recognition framework using deep reinforcement learning and curriculum learning mechanism

Journal

ADVANCED ENGINEERING INFORMATICS
Volume 54, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.aei.2022.101773

Keywords

Fault recognition; Deep reinforcement learning; Multi-label learning; Curriculum learning mechanism; Proximal policy optimization

Funding

  1. National Natural Science Founda-tion of China
  2. National Key R&D Program of China
  3. [52175094]
  4. [2020YFB2007700]

Ask authors/readers for more resources

This paper proposes an improved deep reinforcement learning method for multi-label classification. The method achieves higher accuracy in fault recognition tasks under complex working conditions. By designing an action history vector, the method establishes relationships between predicted labels of input samples and utilizes the curriculum learning mechanism to distinguish labels from easy to complex.
In the actual working site, the equipment often works in different working conditions while the manufacturing system is rather complicated. However, traditional multi-label learning methods need to use the pre-defined label sequence or synchronously predict all labels of the input sample in the fault diagnosis domain. Deep reinforcement learning (DRL) combines the perception ability of deep learning and the decision-making ability of reinforcement learning. Moreover, the curriculum learning mechanism follows the learning approach of humans from easy to complex. Consequently, an improved proximal policy optimization (PPO) method, which is a typical algorithm in DRL, is proposed as a novel method on multi-label classification in this paper. The improved PPO method could build a relationship between several predicted labels of input sample because of designing an action history vector, which encodes all history actions selected by the agent at current time step. In two rolling bearing experiments, the diagnostic results demonstrate that the proposed method provides a higher accuracy than traditional multi-label methods on fault recognition under complicated working conditions. Besides, the proposed method could distinguish the multiple labels of input samples following the curriculum mechanism from easy to complex, compared with the same network using the pre-defined label sequence.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available