4.7 Article

Dynamic job shop scheduling based on deep reinforcement learning for multi-agent manufacturing systems

Journal

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.rcim.2022.102412

Keywords

Flexiblejob-shopschedulingproblem; Smartmanufacturing; Multi-agentmanufacturingsystem; Reinforcementlearning; Proximalpolicyoptimization

Funding

  1. National Key Research and Develop- ment Program of China [2018YFE0177000]
  2. National Natural Science Foundation of China [52075257]
  3. Fundamental Research Funds for the Central Universities [NT2021021]

Ask authors/readers for more resources

This paper proposes a multi-agent manufacturing system based on deep reinforcement learning to address the production planning and control problems caused by personalized orders. The system, with the cooperation and competition among multiple equipment agents and the decision-making module of AI scheduler, efficiently performs task allocation and continuously improves decision-making performance through training and updating.
Personalized orders bring challenges to the production paradigm, and there is an urgent need for the dynamic responsiveness and self-adjustment ability of the workshop. Traditional dispatching rules and heuristic algo-rithms solve the production planning and control problems by making schedules. However, the previous methods cannot work well in a changeable workshop environment when encountering a large number of stochastic disturbances of orders and resources. Recently, the potential of artificial intelligence (AI) algorithms in solving the dynamic scheduling problem has attracted researchers' attention. Therefore, this paper presents a multi -agent manufacturing system based on deep reinforcement learning (DRL), which integrates the self -organization mechanism and self-learning strategy. Firstly, the manufacturing equipment in the workshop is constructed as an equipment agent with the support of edge computing node, and an improved contract network protocol (CNP) is applied to guide the cooperation and competition among multiple agents, so as to complete personalized orders efficiently. Secondly, a multi-layer perceptron is employed to establish the decision-making module called AI scheduler inside the equipment agent. According to the perceived workshop state information, AI scheduler intelligently generates an optimal production strategy to perform task allocation. Then, based on the collected sample trajectories of scheduling process, AI scheduler is periodically trained and updated through the proximal policy optimization (PPO) algorithm to improve its decision-making performance. Finally, in the multi -agent manufacturing system testbed, dynamic events such as stochastic job insertions and unpredictable machine failures are considered in the verification experiments. The experimental results show that the proposed method is capable of obtaining the scheduling solutions that meet various performance metrics, as well as dealing with resource or task disturbances efficiently and autonomously.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available