4.7 Article

A Discrete JAYA Algorithm Based on Reinforcement Learning and Simulated Annealing for the Traveling Salesman Problem

期刊

MATHEMATICS
卷 11, 期 14, 页码 -

出版社

MDPI
DOI: 10.3390/math11143221

关键词

JAYA algorithm; traveling salesman problem; population-based meta-heuristics; reinforcement learning; metropolis acceptance criterion

向作者/读者索取更多资源

The study proposes an improved discrete JAYA algorithm, called QSA-DJAYA, based on reinforcement learning and simulated annealing, for solving the traveling salesman problem in combinatorial optimization. The algorithm embeds the Q-learning algorithm into JAYA algorithm to update the solution by selecting the most promising transformation operator. It also introduces the Metropolis acceptance criterion from simulated annealing to balance exploration and exploitation, and applies 3-opt at certain frequency to improve efficiency. Experimental results show that the QSA-DJAYA algorithm achieves significantly better results compared to other competitive algorithms in most instances.
The JAYA algorithm is a population-based meta-heuristic algorithm proposed in recent years which has been proved to be suitable for solving global optimization and engineering optimization problems because of its simplicity, easy implementation, and guiding characteristic of striving for the best and avoiding the worst. In this study, an improved discrete JAYA algorithm based on reinforcement learning and simulated annealing (QSA-DJAYA) is proposed to solve the well-known traveling salesman problem in combinatorial optimization. More specially, firstly, the basic Q-learning algorithm in reinforcement learning is embedded into the proposed algorithm such that it can choose the most promising transformation operator for the current state to update the solution. Secondly, in order to balance the exploration and exploitation capabilities of the QSA-DJAYA algorithm, the Metropolis acceptance criterion of the simulated annealing algorithm is introduced to determine whether to accept candidate solutions. Thirdly, 3-opt is applied to the best solution of the current iteration at a certain frequency to improve the efficiency of the algorithm. Finally, to evaluate the performance of the QSA-DJAYA algorithm, it has been tested on 21 benchmark datasets taken from TSPLIB and compared with other competitive algorithms in two groups of comparative experiments. The experimental and the statistical significance test results show that the QSA-DJAYA algorithm achieves significantly better results in most instances.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据