☆ 3.8 Proceedings Paper

Learning a Behavioral Repertoire from Demonstrations

2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020) (2020)

Journal

2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020)

Volume -, Issue -, Pages 383-390

Publisher

IEEE

Keywords

StarCraft II; imitation learning; build-order planning; online adaptation

Funding

Elite Research travel grant from The Danish Ministry for Higher Education and Science
Universidad Nacional de Colombia
European Research Council [637972]
Lifelong Learning Machines program from DARPA/MTO [FA8750-18-C-0103]
European Research Council (ERC) [637972] Funding Source: European Research Council (ERC)

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Imitation Learning (IL) is a machine learning approach to learn a policy from a set of demonstrations. IL can be useful to kick-start learning before applying reinforcement learning (RL) but it can also be useful on its own, e.g. to learn to imitate human players in video games. Despite the success of systems that use IL and RL, how such systems can adapt in-between game rounds is a neglected area of study but an important aspect of many strategy games. In this paper, we present a new approach called Behavioral Repertoire Imitation Learning (BRIL) that learns a repertoire of behaviors from a set of demonstrations by augmenting the state-action pairs with behavioral descriptions. The outcome of this approach is a single neural network policy conditioned on a behavior description that can be precisely modulated. We apply this approach to train a policy on 7,777 human demonstrations for the build-order planning task in StarCraft II. Dimensionality reduction is applied to construct a low-dimensional behavioral space from a high-dimensional description of the army unit composition of each human replay. The results demonstrate that the learned policy can be effectively manipulated to express distinct behaviors. Additionally, by applying the UCB1 algorithm, the policy can adapt its behavior - in-between games - to reach a performance beyond that of the traditional IL baseline approach.

Learning a Behavioral Repertoire from Demonstrations

Journal

2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020)

Publisher

IEEE

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Learning a Behavioral Repertoire from Demonstrations

Journal

2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020)

Publisher

IEEE

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper