On-Line Parameter Tuning for Monte-Carlo Tree Search in General Game Playing

4Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Many enhancements have been proposed for Monte-Carlo Tree Search (MCTS). Some of them have been applied successfully in the context of General Game Playing (GGP). MCTS and its enhancements are usually controlled by multiple parameters that require extensive and time-consuming computation to be tuned in advance. Moreover, in GGP optimal parameter values may vary depending on the considered game. This paper proposes a method to automatically tune search-control parameters on-line for GGP. This method considers the tuning problem as a Combinatorial Multi-Armed Bandit (CMAB). Four strategies designed to deal with CMABs are evaluated for this particular problem. Experiments show that on-line tuning in GGP almost reaches the same performance as off-line tuning. It can be considered as a valid alternative for domains where off-line parameter tuning is costly or infeasible.

Cite

CITATION STYLE

APA

Sironi, C. F., & Winands, M. H. M. (2018). On-Line Parameter Tuning for Monte-Carlo Tree Search in General Game Playing. In Communications in Computer and Information Science (Vol. 818, pp. 75–95). Springer Verlag. https://doi.org/10.1007/978-3-319-75931-9_6

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free