☆ 4.6 Article

Multi-source transfer ELM-based Q learning

NEUROCOMPUTING (2014)

Journal

NEUROCOMPUTING

Volume 137, Issue -, Pages 57-64

Publisher

ELSEVIER

DOI: 10.1016/j.neucom.2013.04.045

Keywords

Q learning; Extreme learning machine; Continuous space; Multi-source transfer; Boat problem

Funding

National Nature Science Foundation of China [61273143]
Specialized Research Fund for the Doctoral Program of Higher Education of China [20120095110025]
Fundamental Research Project of Central Universities [2012LWB70]
College Graduate Research and Innovation Projects of Jiangsu Province [CXZZ12_0932]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Extreme learning machine (ELM) has advantages of good generalization property, simple structure and convenient calculation. Therefore, an ELM-based Q learning is proposed by using an ELM as a Q-value function approximator, which is suitable for large-scale or continuous space problems. This is the first contribution of this paper. Because the number of ELM hidden layer nodes is equal to that of training samples, large sample size will seriously affect the learning speed. Therefore, a rolling time-window mechanism is introduced into the ELM-based Q learning to reduce the size of training samples of the ELM. In addition, in order to reduce the learning difficulty of new tasks, transfer learning technology is introduced into the ELM-based Q learning. The transfer learning technology can reuse past experience and knowledge to solve current issues. Thus the second contribution is to propose a multi-source transfer ELM-based Q learning (MST-ELMQ), which can take full advantage of valuable information from multiple source tasks and avoid negative transfer resulted from irrelevant information. According to the Bayesian theory, each source task is assigned with a task transfer weight and each source sample is assigned with a sample transfer weight. The task and sample transfer weights determine the number and the manner of transfer samples. Samples with large sample transfer weights are selected from each source task, and assist Q learning agent in quick decision-making for the target task. Simulations results concerning on a boat problem show that MST-ELMQ has better performance than that of Q learning algorithms without or with a single source task, i.e., it can effectively reduce learning difficulty and find an optimal solution with fewer number of training. (C) 2013 Elsevier B.V. All rights reserved.

Multi-source transfer ELM-based Q learning

Journal

NEUROCOMPUTING

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Multi-source transfer ELM-based Q learning

Journal

NEUROCOMPUTING

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper