4.7 Article

A graph-based reinforcement learning-enabled approach for adaptive human-robot collaborative assembly operations

Journal

JOURNAL OF MANUFACTURING SYSTEMS
Volume 63, Issue -, Pages 491-503

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.jmsy.2022.05.006

Keywords

Human-robot coexisting; Part-behavior assembly and; or graph; Behavior prediction; Self-attention; Adaptive decision making; Reinforcement learning

Funding

  1. National Key Research and Development Plan of China [2019YFB1706300]
  2. Fundamental Research Funds for the Central Universities [2232019D3-32]
  3. Shanghai Sailing Program [19YF1401600]

Ask authors/readers for more resources

This research presents a method for human-robot collaborative assembly, representing the assembly task of complex products using part-behavior assembly and/or graph based on process requirements. In dynamic scenes, the combination of a human behavior prediction network based on self-attention and the robustness of Soft Actor-Critic algorithm enhances the robot's autonomous decision-making ability. Experimental analysis verifies the effectiveness of the method and demonstrates the feasibility of reinforcement learning for adaptive decision-making in human-robot collaboration environments.
In today's prevailing manufacturing paradigm of mass personalization, neither human operators nor robots alone can perform all assembly tasks efficiently. To overcome it, human-robot collaborative assembly shows its great potentials to ensure the flexibility of human operations with high reliability of robot assistance. However, it is often challenging to achieve harmonious coexistence between humans and robots to complete the tasks safely and efficiently. In this regard, this research provides a detailed description of the human-robot coexisting environment and further introduces key issues in collaborative assembly. A part-behavior assembly and/or graph based on process requirements is proposed to represent the assembly task of complex products. Moreover, the human behavior prediction network based on self-attention can achieve higher accuracy. Combined with the robustness of Soft Actor-Critic (SAC), the collaborative system improves the self-decision ability of the robot in the dynamic scene. Finally, the effectiveness of the method is verified through experimental analysis. The results indicate that the accuracy of the proposed behavior recognition based on self-attention method is 91%. At the same time, it is proved that the reinforcement learning method is theoretically feasible to provide adaptive decision-making for robots in human-machine collaboration. In addition, the convergence speed of the reward function proves the feasibility of SAC for adaptive decision-making in a human-robot collaborative environment.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available