4.7 Article

Error controlled actor-critic

Journal

INFORMATION SCIENCES
Volume 612, Issue -, Pages 62-74

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2022.08.079

Keywords

Reinforcement learning; Actor-critic; Approximation error; Overestimation; KL-divergence; Reinforcement learning; Actor-critic; Approximation error; Overestimation; KL-divergence

Funding

  1. Natural Science Foundation of Fujian Province of China [2021J01002]
  2. High- level Talent Project of Xiamen University of Technology [YKJ22028R]

Ask authors/readers for more resources

This article discusses the negative effects of the approximation inaccuracy in reinforcement learning algorithms on algorithm convergence. To mitigate the impact of approximation error, the error controlled actor-critic (ECAC) approach is proposed, which limits the approximation error within the value function. Experimental results demonstrate that the proposed actor-critic approach reduces approximation error and outperforms previous model-free RL algorithms.
The approximation inaccuracy of the value function in reinforcement learning (RL) algo-rithms unavoidably leads to an overestimation phenomenon, which has negative effects on the convergence of the algorithms. To limit the negative effects of the approximation error, we propose error controlled actor-critic (ECAC) which ensures the approximation error is limited within the value function. We present an investigation of how approxima-tion inaccuracy can impair the optimization process of actor-critic approaches. In addition, we derive an upper bound for the approximation error of the Q function approximator and discover that the error can be reduced by limiting the KL-divergence between every two consecutive policies during policy training. Experiments on a variety of continuous control tasks demonstrate that the proposed actor-critic approach decreases approximation error and outperforms previous model-free RL algorithms by a significant margin.(c) 2022 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available