4.0 Article

BALANCING MCTS BY DYNAMICALLY ADJUSTING THE KOMI VALUE

Journal

ICGA JOURNAL
Volume 34, Issue 3, Pages 131-139

Publisher

UNIV MAASTRICHT FACULTY GENERAL SCIENCES
DOI: 10.3233/ICG-2011-34302

Keywords

-

Funding

  1. GAUK of Charles University Prague [66010]

Ask authors/readers for more resources

Monte-Carlo Tree Search tends to produce unstable and unreasonable results in the game of Go when used in positions with an extreme advantage or disadvantage. This is due to a poor move selection because of the low signal-to-noise ratio. Notably, it frequently occurs, when playing in a high handicap game. The handicap advantage is in some sense a disadvantage for the computer when playing against a strong human opponent. We explore and compare multiple approaches to mitigate this problem by artificially evening out the game by modifying the final game score by a variable amount of points (dynamic komi) before noting the result in the game tree. Moreover. Nye compare the performance of MCTS and the traditional tree search in the context of extreme positions and measure the effect of the dynamic komi on the actual playing strength of a state-of-art MCTS Go program. Based on our results, we formulate a conjecture on the resilience of the game search tree to changes in the evaluation function throughout the search.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.0
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available