Monte-Carlo tree search enhancements for Havannah

Jan A. Stankiewicz; Mark H.M. Winands; Jos W.H.M. Uiterwijk

Conference Proceedings

Monte-Carlo tree search enhancements for Havannah

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7168 LNCS 60-71

DOI: 10.1007/978-3-642-31866-5_6

7Citations

7Readers

Get full text

Abstract

This article shows how the performance of a Monte-Carlo Tree Search (MCTS) player for Havannah can be improved by guiding the search in the playout and selection steps of MCTS. To improve the playout step of the MCTS algorithm, we used two techniques to direct the simulations, Last-Good-Reply (LGR) and N-grams. Experiments reveal that LGR gives a significant improvement, although it depends on which LGR variant is used. Using N-grams to guide the playouts also achieves a significant increase in the winning percentage. Combining N-grams with LGR leads to a small additional improvement. To enhance the selection step of the MCTS algorithm, we initialize the visit and win counts of the new nodes based on pattern knowledge. By biasing the selection towards joint/neighbor moves, local connections, and edge/corner connections, a significant improvement in the performance is obtained. Experiments show that the best overall performance is obtained when combining the visit-and-win-count initialization with LGR and N-grams. In the best case, a winning percentage of 77.5% can be achieved against the default MCTS program. © 2012 Springer-Verlag.

Cite

CITATION STYLE

APA

Stankiewicz, J. A., Winands, M. H. M., & Uiterwijk, J. W. H. M. (2012). Monte-Carlo tree search enhancements for Havannah. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7168 LNCS, pp. 60–71). https://doi.org/10.1007/978-3-642-31866-5_6

Monte-Carlo tree search enhancements for Havannah

Abstract

Cite

Register to see more suggestions