4.7 Article

A graph-based reinforcement learning-enabled approach for adaptive human-robot collaborative assembly operations

期刊

JOURNAL OF MANUFACTURING SYSTEMS
卷 63, 期 -, 页码 491-503

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.jmsy.2022.05.006

关键词

Human-robot coexisting; Part-behavior assembly and; or graph; Behavior prediction; Self-attention; Adaptive decision making; Reinforcement learning

资金

  1. National Key Research and Development Plan of China [2019YFB1706300]
  2. Fundamental Research Funds for the Central Universities [2232019D3-32]
  3. Shanghai Sailing Program [19YF1401600]

向作者/读者索取更多资源

This research presents a method for human-robot collaborative assembly, representing the assembly task of complex products using part-behavior assembly and/or graph based on process requirements. In dynamic scenes, the combination of a human behavior prediction network based on self-attention and the robustness of Soft Actor-Critic algorithm enhances the robot's autonomous decision-making ability. Experimental analysis verifies the effectiveness of the method and demonstrates the feasibility of reinforcement learning for adaptive decision-making in human-robot collaboration environments.
In today's prevailing manufacturing paradigm of mass personalization, neither human operators nor robots alone can perform all assembly tasks efficiently. To overcome it, human-robot collaborative assembly shows its great potentials to ensure the flexibility of human operations with high reliability of robot assistance. However, it is often challenging to achieve harmonious coexistence between humans and robots to complete the tasks safely and efficiently. In this regard, this research provides a detailed description of the human-robot coexisting environment and further introduces key issues in collaborative assembly. A part-behavior assembly and/or graph based on process requirements is proposed to represent the assembly task of complex products. Moreover, the human behavior prediction network based on self-attention can achieve higher accuracy. Combined with the robustness of Soft Actor-Critic (SAC), the collaborative system improves the self-decision ability of the robot in the dynamic scene. Finally, the effectiveness of the method is verified through experimental analysis. The results indicate that the accuracy of the proposed behavior recognition based on self-attention method is 91%. At the same time, it is proved that the reinforcement learning method is theoretically feasible to provide adaptive decision-making for robots in human-machine collaboration. In addition, the convergence speed of the reward function proves the feasibility of SAC for adaptive decision-making in a human-robot collaborative environment.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据