☆ 4.6 Article

Multi-source transfer ELM-based Q learning

NEUROCOMPUTING (2014)

期刊

NEUROCOMPUTING

卷 137, 期 -, 页码 57-64

出版社

ELSEVIER

DOI: 10.1016/j.neucom.2013.04.045

关键词

Q learning; Extreme learning machine; Continuous space; Multi-source transfer; Boat problem

类别

Computer Science, Artificial Intelligence

资金

National Nature Science Foundation of China [61273143]
Specialized Research Fund for the Doctoral Program of Higher Education of China [20120095110025]
Fundamental Research Project of Central Universities [2012LWB70]
College Graduate Research and Innovation Projects of Jiangsu Province [CXZZ12_0932]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Extreme learning machine (ELM) has advantages of good generalization property, simple structure and convenient calculation. Therefore, an ELM-based Q learning is proposed by using an ELM as a Q-value function approximator, which is suitable for large-scale or continuous space problems. This is the first contribution of this paper. Because the number of ELM hidden layer nodes is equal to that of training samples, large sample size will seriously affect the learning speed. Therefore, a rolling time-window mechanism is introduced into the ELM-based Q learning to reduce the size of training samples of the ELM. In addition, in order to reduce the learning difficulty of new tasks, transfer learning technology is introduced into the ELM-based Q learning. The transfer learning technology can reuse past experience and knowledge to solve current issues. Thus the second contribution is to propose a multi-source transfer ELM-based Q learning (MST-ELMQ), which can take full advantage of valuable information from multiple source tasks and avoid negative transfer resulted from irrelevant information. According to the Bayesian theory, each source task is assigned with a task transfer weight and each source sample is assigned with a sample transfer weight. The task and sample transfer weights determine the number and the manner of transfer samples. Samples with large sample transfer weights are selected from each source task, and assist Q learning agent in quick decision-making for the target task. Simulations results concerning on a boat problem show that MST-ELMQ has better performance than that of Q learning algorithms without or with a single source task, i.e., it can effectively reduce learning difficulty and find an optimal solution with fewer number of training. (C) 2013 Elsevier B.V. All rights reserved.

Multi-source transfer ELM-based Q learning

期刊

NEUROCOMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Multi-source transfer ELM-based Q learning

期刊

NEUROCOMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文