☆ 4.7 Article

Concurrent Q-learning: Reinforcement learning for dynamic goals and environments

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS (2005)

Journal

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS

Volume 20, Issue 10, Pages 1037-1052

Publisher

JOHN WILEY & SONS INC

DOI: 10.1002/int.20105

Keywords

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

This article presents a powerful new algorithm for reinforcement learning in problems where the goals and also the environment may change. The algorithm is completely goal independent, allowing the mechanics of the environment to be learned independently of the task that is being undertaken. Conventional reinforcement learning techniques, such as Q-learning, are goal dependent. When the goal or reward conditions change, previous learning interferes with the new task that is being learned, resulting in very poor performance. Previously, the Concurrent Q-Learning algorithm was developed, based on Watkin's Q-learning, which learns the relative proximity of all states simultaneously. This learning is completely independent of the reward experienced at those states and, through a simple action selection strategy, may be applied to any given reward structure. Here it is shown that the extra information obtained may be used to replace the eligibility traces of Watkin's Q-learning, allowing many more value updates to be made at each time step. The new algorithm is compared to the previous version and also to DG-learning in tasks involving changing goals and environments. The new algorithm is shown to perform significantly better than these alternatives, especially in situations involving novel obstructions. The algorithm adapts quickly and intelligently to changes in both the environment and reward structure, and does not suffer interference from training undertaken prior to those changes. (c) 2005 Wiley Periodicals, Inc.

Concurrent Q-learning: Reinforcement learning for dynamic goals and environments

Journal

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS

Publisher

JOHN WILEY & SONS INC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Concurrent Q-learning: Reinforcement learning for dynamic goals and environments

Journal

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS

Publisher

JOHN WILEY & SONS INC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper