4.4 Article

A new path plan method based on hybrid algorithm of reinforcement learning and particle swarm optimization

期刊

ENGINEERING COMPUTATIONS
卷 39, 期 3, 页码 993-1019

出版社

EMERALD GROUP PUBLISHING LTD
DOI: 10.1108/EC-09-2020-0500

关键词

Reinforcement learning; Path planning; Intelligent driving; Shortest path algorithm

资金

  1. National Natural Science Foundation of China [61571328]
  2. Tianjin Key Natural Science Foundation Project [18JCZDJC96800]
  3. Tianjin Science and Technology Major Project [15ZXDSGX 00050]
  4. Tianjin Science and Technology Innovation Team Fund Projects [TD12-5016, TD13-5025, TD2015-23]
  5. Tianjin Science and Technology Service Industry Major Science and Technology Project [16ZXFWGX00010, 17YFZCGX00360]

向作者/读者索取更多资源

The hybrid path planning algorithm in this paper combines optimized reinforcement learning and improved particle swarm optimization to achieve efficient path planning results. By optimizing RL hyperparameters, designing a pre-set operation for PSO, and proposing a correction variable, the algorithm selects the optimal path effectively.
Purpose To solve the path planning problem of the intelligent driving vehicular, this paper designs a hybrid path planning algorithm based on optimized reinforcement learning (RL) and improved particle swarm optimization (PSO). Design/methodology/approach First, the authors optimized the hyper-parameters of RL to make it converge quickly and learn more efficiently. Then the authors designed a pre-set operation for PSO to reduce the calculation of invalid particles. Finally, the authors proposed a correction variable that can be obtained from the cumulative reward of RL; this revises the fitness of the individual optimal particle and global optimal position of PSO to achieve an efficient path planning result. The authors also designed a selection parameter system to help to select the optimal path. Findings Simulation analysis and experimental test results proved that the proposed algorithm has advantages in terms of practicability and efficiency. This research also foreshadows the research prospects of RL in path planning, which is also the authors' next research direction. Originality/value The authors designed a pre-set operation to reduce the participation of invalid particles in the calculation in PSO. And then, the authors designed a method to optimize hyper-parameters to improve learning efficiency of RL. And then they used RL trained PSO to plan path. The authors also proposed an optimal path evaluation system. This research also foreshadows the research prospects of RL in path planning, which is also the authors' next research direction.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据