☆ 4.7 Article

Concurrent Q-learning: Reinforcement learning for dynamic goals and environments

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS (2005)

期刊

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS

卷 20, 期 10, 页码 1037-1052

出版社

JOHN WILEY & SONS INC

DOI: 10.1002/int.20105

关键词

类别

Computer Science, Artificial Intelligence

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

This article presents a powerful new algorithm for reinforcement learning in problems where the goals and also the environment may change. The algorithm is completely goal independent, allowing the mechanics of the environment to be learned independently of the task that is being undertaken. Conventional reinforcement learning techniques, such as Q-learning, are goal dependent. When the goal or reward conditions change, previous learning interferes with the new task that is being learned, resulting in very poor performance. Previously, the Concurrent Q-Learning algorithm was developed, based on Watkin's Q-learning, which learns the relative proximity of all states simultaneously. This learning is completely independent of the reward experienced at those states and, through a simple action selection strategy, may be applied to any given reward structure. Here it is shown that the extra information obtained may be used to replace the eligibility traces of Watkin's Q-learning, allowing many more value updates to be made at each time step. The new algorithm is compared to the previous version and also to DG-learning in tasks involving changing goals and environments. The new algorithm is shown to perform significantly better than these alternatives, especially in situations involving novel obstructions. The algorithm adapts quickly and intelligently to changes in both the environment and reward structure, and does not suffer interference from training undertaken prior to those changes. (c) 2005 Wiley Periodicals, Inc.

Concurrent Q-learning: Reinforcement learning for dynamic goals and environments

期刊

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS

出版社

JOHN WILEY & SONS INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Concurrent Q-learning: Reinforcement learning for dynamic goals and environments

期刊

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS

出版社

JOHN WILEY & SONS INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文