4.7 Article

Model-based reinforcement learning with dimension reduction

Journal

NEURAL NETWORKS
Volume 84, Issue -, Pages 1-16

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.neunet.2016.08.005

Keywords

Model-based reinforcement learning; Transition model estimation; Sufficient dimension reduction

Funding

  1. KAKENHI [23120004, 25700022]
  2. NEDO [15101157-0]
  3. Grants-in-Aid for Scientific Research [16J08434] Funding Source: KAKEN

Ask authors/readers for more resources

The goal of reinforcement learning is to learn an optimal policy which controls an agent to acquire the maximum cumulative reward. The model-based reinforcement learning approach learns a transition model of the environment from data, and then derives the optimal policy using the transition model. However, learning an accurate transition model in high-dimensional environments requires a large amount of data which is difficult to obtain. To overcome this difficulty, in this paper, we propose to combine model-based reinforcement learning with the recently developed least-squares conditional entropy (LSCE) method, which simultaneously performs transition model estimation and dimension reduction. We also further extend the proposed method to imitation learning scenarios. The experimental results show that policy search combined with LSCE performs well for high-dimensional control tasks including real humanoid robot control. (C) 2016 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available