4.7 Article

Real-time scheduling for distributed permutation flowshops with dynamic job arrivals using deep reinforcement learning

期刊

ADVANCED ENGINEERING INFORMATICS
卷 54, 期 -, 页码 -

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.aei.2022.101776

关键词

Distributed flowshop scheduling; Deep reinforcement learning; Real-time scheduling; Dynamic job arrivals; Intelligent scheduling; Deep Q -network

资金

  1. National Defense Basic Scientific Research Program of China
  2. [JCKY2021208B003]

向作者/读者索取更多资源

This study proposes a solution to the distributed permutation flowshop scheduling problem using deep reinforcement learning. By designing suitable reward function and scheduling actions, an intelligent scheduling agent is trained, which significantly improves the solution quality and computation efficiency.
Distributed manufacturing plays an important role for large-scale companies to reduce production and trans-portation costs for globalized orders. However, how to real-timely and properly assign dynamic orders to distributed workshops is a challenging problem. To provide real-time and intelligent decision-making of scheduling for distributed flowshops, we studied the distributed permutation flowshop scheduling problem (DPFSP) with dynamic job arrivals using deep reinforcement learning (DRL). The objective is to minimize the total tardiness cost of all jobs. We provided the training and execution procedures of intelligent scheduling based on DRL for the dynamic DPFSP. In addition, we established a DRL-based scheduling model for distributed flowshops by designing suitable reward function, scheduling actions, and state features. A novel reward function is designed to directly relate to the objective. Various problem-specific dispatching rules are introduced to provide efficient actions for different production states. Furthermore, four efficient DRL algorithms, including deep Q-network (DQN), double DQN (DbDQN), dueling DQN (DlDQN), and advantage actor-critic (A2C), are adapted to train the scheduling agent. The training curves show that the agent learned to generate better so-lutions effectively and validate that the system design is reasonable. After training, all DRL algorithms outper-form traditional meta-heuristics and well-known priority dispatching rules (PDRs) by a large margin in terms of solution quality and computation efficiency. This work shows the effectiveness of DRL for the real-time sched-uling of dynamic DPFSP.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据