Artificial Intelligence has seen several breakthroughs in two-player perfect information game. Nevertheless, Doudizhu, a three-player imperfect information game, is still quite challenging. In this paper, we present a Doudizhu AI by applying deep reinforcement learning from games of self-play. The algorithm combines an asymmetric MCTS on nodes representing each player's information set, a policy-value network that approximates the policy and value on each decision node, and inference on unobserved hands of other players by given policy. Our results show that self-play can significantly improve the performance of our agent in this multiagent imperfect information game. Even starting with a weak AI, our agent can achieve human expert level after days of self-play and training.
CITATION STYLE
Jiang, Q., Li, K., Du, B., Chen, H., & Fang, H. (2019). Deltadou: Expert-level doudizhu AI through self-play. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2019-August, pp. 1265–1271). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2019/176
Mendeley helps you to discover research relevant for your work.