4.5 Article

Discrete space reinforcement learning algorithm based on twin support vector machine classification

期刊

PATTERN RECOGNITION LETTERS
卷 164, 期 -, 页码 254-260

出版社

ELSEVIER
DOI: 10.1016/j.patrec.2022.11.017

关键词

Twin support vector machines; Actor-Critic; Reinforcement learning; Small-scale discrete space environment

资金

  1. National Natural Science Foundation of China (NSFC) [12001450, 20212023, UJA13MZ]

向作者/读者索取更多资源

Reinforcement learning is an important component of machine learning algorithms, but traditional algorithms have convergence speed and accuracy issues in small-scale discrete space environments. Recent research has proposed an improved algorithm based on support vector machines, which utilizes the advantages of twin support vector machines to enhance convergence speed and accuracy.
Reinforcement learning (RL) has become one of the key component of various machine learning algorithms in recent years. However, traditional RL algorithms lack convergence speed and accuracy in small-scale discrete space environment. Recently An et al. proposed RL algorithm based on support vector machines (SVMs) (Pattern Recognit. Lett. 111 (2018) 30-35) which adopts the Advantage Actor-Critic (A2C) framework and improves the speed and accuracy of convergence in discrete space. Owing to the advantages of twin support vector machines (TWSVMs) over SVMs, in this paper, we propose a RL algorithm based on TWSVM classification. The proposed algorithm adopts a modified A2C framework, where there are multiple Actors and a single Critic. Finally, we compare the performance of the proposed algorithm with some existing algorithms in traditional RL environment. Interestingly, the proposed algorithm outperforms the existing algorithms in terms of convergence speed and accuracy. (c) 2022 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据