Many enhancements have been proposed for Monte-Carlo Tree Search (MCTS). Some of them have been applied successfully in the context of General Game Playing (GGP). MCTS and its enhancements are usually controlled by multiple parameters that require extensive and time-consuming computation to be tuned in advance. Moreover, in GGP optimal parameter values may vary depending on the considered game. This paper proposes a method to automatically tune search-control parameters on-line for GGP. This method considers the tuning problem as a Combinatorial Multi-Armed Bandit (CMAB). Four strategies designed to deal with CMABs are evaluated for this particular problem. Experiments show that on-line tuning in GGP almost reaches the same performance as off-line tuning. It can be considered as a valid alternative for domains where off-line parameter tuning is costly or infeasible.
CITATION STYLE
Sironi, C. F., & Winands, M. H. M. (2018). On-Line Parameter Tuning for Monte-Carlo Tree Search in General Game Playing. In Communications in Computer and Information Science (Vol. 818, pp. 75–95). Springer Verlag. https://doi.org/10.1007/978-3-319-75931-9_6
Mendeley helps you to discover research relevant for your work.