4.6 Article

Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor-Critic with Hindsight Experience Replay

Journal

SENSORS
Volume 20, Issue 20, Pages -

Publisher

MDPI
DOI: 10.3390/s20205911

Keywords

path planning; multi-arm manipulators; reinforcement learning; soft actor-critic (SAC); hindsight experience replay (HER); collision avoidance

Funding

  1. Ministry of Trade, Industry & Energy(MOTIE, Korea) [20005024]
  2. National Research Foundation of Korea(NRF) - Ministry of Education [NRF-2019R1A6A1A03032119]
  3. Korea Evaluation Institute of Industrial Technology (KEIT) [20005024] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

Ask authors/readers for more resources

Since path planning for multi-arm manipulators is a complicated high-dimensional problem, effective and fast path generation is not easy for the arbitrarily given start and goal locations of the end effector. Especially, when it comes to deep reinforcement learning-based path planning, high-dimensionality makes it difficult for existing reinforcement learning-based methods to have efficient exploration which is crucial for successful training. The recently proposed soft actor-critic (SAC) is well known to have good exploration ability due to the use of the entropy term in the objective function. Motivated by this, in this paper, a SAC-based path planning algorithm is proposed. The hindsight experience replay (HER) is also employed for sample efficiency and configuration space augmentation is used in order to deal with complicated configuration space of the multi-arms. To show the effectiveness of the proposed algorithm, both simulation and experiment results are given. By comparing with existing results, it is demonstrated that the proposed method outperforms the existing results.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available