期刊
ENTERTAINMENT COMPUTING
卷 42, 期 -, 页码 -出版社
ELSEVIER SCI LTD
DOI: 10.1016/j.entcom.2022.100493
关键词
Game AI; Real-Time Strategy; MCTS; mu RTS; Genetic Algorithms; Parameter Optimization
类别
资金
- Ministerio Espanol de Economia y Competitividad [P18-RT-4830]
- Junta de Andalucia [A-TIC-608-UGR20, B-TIC-402-UGR18]
- Ministerio Espanol de Economia y Competitividad [P18-RT-4830]
- Junta de Andalucia [A-TIC-608-UGR20, B-TIC-402-UGR18]
- FEDER and Junta de Andalucia [PID2020-113462RB- I00]
Real-Time Strategy games pose challenges for search and machine learning due to their large combinatorial decision and state spaces. Exploiting domain knowledge can assist in navigating these spaces and improve game-playing agents' performance.
Real-Time Strategy (RTS) games are well-known for their substantially large combinatorial decision and state spaces, responsible for creating significant challenges for search and machine learning techniques. Exploiting domain knowledge to assist in navigating the expansive decision and state spaces could facilitate the emergence of competitive RTS game-playing agents. Usually, domain knowledge can take the form of expert traces or expert-authored scripts. A script encodes a strategy conceived by a human expert and can be used to steer a search algorithm, such as Monte Carlo Tree Search (MCTS), towards high-value states. However, a script is coarse by nature, meaning that it could be subject to exploitation and poor low-level tactical performance. We propose to perceive scripts as a collection of heuristics that can be parameterized and combined to form a wide array of strategies. The parameterized heuristics mold and filter the decision space in favor of a strategy expressed in terms of parameters. The proposed agent, ParaMCTS, implements several common heuristics and uses NaiveMCTS to search the downsized decision space; however, it requires a preceding manual parameterization step. A genetic algorithm is proposed for use in an optimization phase that aims to replace manual tuning and find an optimal set of parameters for use by EvoPMCTS, the evolutionary counterpart of ParaMCTS. Experimentation results using the mu RTS testbed show that EvoPMCTS outperforms several state-of-the-art agents across multiple maps of distinct layouts.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据