☆ 4.7 Article

Dynamic job shop scheduling based on deep reinforcement learning for multi-agent manufacturing systems

ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING (2022)

Journal

ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING

Volume 78, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.rcim.2022.102412

Keywords

Flexiblejob-shopschedulingproblem; Smartmanufacturing; Multi-agentmanufacturingsystem; Reinforcementlearning; Proximalpolicyoptimization

Funding

National Key Research and Develop- ment Program of China [2018YFE0177000]
National Natural Science Foundation of China [52075257]
Fundamental Research Funds for the Central Universities [NT2021021]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This paper proposes a multi-agent manufacturing system based on deep reinforcement learning to address the production planning and control problems caused by personalized orders. The system, with the cooperation and competition among multiple equipment agents and the decision-making module of AI scheduler, efficiently performs task allocation and continuously improves decision-making performance through training and updating.

Personalized orders bring challenges to the production paradigm, and there is an urgent need for the dynamic responsiveness and self-adjustment ability of the workshop. Traditional dispatching rules and heuristic algo-rithms solve the production planning and control problems by making schedules. However, the previous methods cannot work well in a changeable workshop environment when encountering a large number of stochastic disturbances of orders and resources. Recently, the potential of artificial intelligence (AI) algorithms in solving the dynamic scheduling problem has attracted researchers' attention. Therefore, this paper presents a multi -agent manufacturing system based on deep reinforcement learning (DRL), which integrates the self -organization mechanism and self-learning strategy. Firstly, the manufacturing equipment in the workshop is constructed as an equipment agent with the support of edge computing node, and an improved contract network protocol (CNP) is applied to guide the cooperation and competition among multiple agents, so as to complete personalized orders efficiently. Secondly, a multi-layer perceptron is employed to establish the decision-making module called AI scheduler inside the equipment agent. According to the perceived workshop state information, AI scheduler intelligently generates an optimal production strategy to perform task allocation. Then, based on the collected sample trajectories of scheduling process, AI scheduler is periodically trained and updated through the proximal policy optimization (PPO) algorithm to improve its decision-making performance. Finally, in the multi -agent manufacturing system testbed, dynamic events such as stochastic job insertions and unpredictable machine failures are considered in the verification experiments. The experimental results show that the proposed method is capable of obtaining the scheduling solutions that meet various performance metrics, as well as dealing with resource or task disturbances efficiently and autonomously.

Dynamic job shop scheduling based on deep reinforcement learning for multi-agent manufacturing systems

Journal

ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Dynamic job shop scheduling based on deep reinforcement learning for multi-agent manufacturing systems

Journal

ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper