4.3 Article

Evolving action pre-selection parameters for MCTS in real-time strategy games

期刊

ENTERTAINMENT COMPUTING
卷 42, 期 -, 页码 -

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.entcom.2022.100493

关键词

Game AI; Real-Time Strategy; MCTS; mu RTS; Genetic Algorithms; Parameter Optimization

资金

  1. Ministerio Espanol de Economia y Competitividad [P18-RT-4830]
  2. Junta de Andalucia [A-TIC-608-UGR20, B-TIC-402-UGR18]
  3. Ministerio Espanol de Economia y Competitividad [P18-RT-4830]
  4. Junta de Andalucia [A-TIC-608-UGR20, B-TIC-402-UGR18]
  5. FEDER and Junta de Andalucia [PID2020-113462RB- I00]

向作者/读者索取更多资源

Real-Time Strategy games pose challenges for search and machine learning due to their large combinatorial decision and state spaces. Exploiting domain knowledge can assist in navigating these spaces and improve game-playing agents' performance.
Real-Time Strategy (RTS) games are well-known for their substantially large combinatorial decision and state spaces, responsible for creating significant challenges for search and machine learning techniques. Exploiting domain knowledge to assist in navigating the expansive decision and state spaces could facilitate the emergence of competitive RTS game-playing agents. Usually, domain knowledge can take the form of expert traces or expert-authored scripts. A script encodes a strategy conceived by a human expert and can be used to steer a search algorithm, such as Monte Carlo Tree Search (MCTS), towards high-value states. However, a script is coarse by nature, meaning that it could be subject to exploitation and poor low-level tactical performance. We propose to perceive scripts as a collection of heuristics that can be parameterized and combined to form a wide array of strategies. The parameterized heuristics mold and filter the decision space in favor of a strategy expressed in terms of parameters. The proposed agent, ParaMCTS, implements several common heuristics and uses NaiveMCTS to search the downsized decision space; however, it requires a preceding manual parameterization step. A genetic algorithm is proposed for use in an optimization phase that aims to replace manual tuning and find an optimal set of parameters for use by EvoPMCTS, the evolutionary counterpart of ParaMCTS. Experimentation results using the mu RTS testbed show that EvoPMCTS outperforms several state-of-the-art agents across multiple maps of distinct layouts.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据