Playing hanabi near-optimally

Bruno Bouzy

Conference Proceedings

Playing hanabi near-optimally

Bouzy B

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10664 LNCS 51-62

DOI: 10.1007/978-3-319-71649-7_5

18Citations

7Readers

Get full text

Abstract

This paper describes a study on the game of Hanabi, a multi-player cooperative card game in which a player sees the cards of the other players but not his own cards. Previous work using the hat principle reached near-optimal results for 5 players and 4 cards per player: the perfect score was reached 75% of times on average. In the current work, we develop Hannibal, a set of players, aiming at obtaining near-optimal results as well. Our best players use the hat principle and a depth-one search algorithm. For 5 players and 4 cards per player, the perfect score was reached 92% of times on average. In addition, by relaxing a debatable rule of Hanabi, we generalized the near-optimal results to other numbers of players and cards per player: the perfect score was reached 90% of times on average. Furthermore, for 2 players, the hat principle is useless, and we used a confidence player obtaining high quality results as well. Overall, this study shows that the game of Hanabi can be played near-optimally by the computer.

Cite

CITATION STYLE

APA

Bouzy, B. (2017). Playing hanabi near-optimally. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10664 LNCS, pp. 51–62). Springer Verlag. https://doi.org/10.1007/978-3-319-71649-7_5

Playing hanabi near-optimally

Abstract

Cite

Register to see more suggestions