Deltadou: Expert-level doudizhu AI through self-play

47Citations
Citations of this article
42Readers
Mendeley users who have this article in their library.

Abstract

Artificial Intelligence has seen several breakthroughs in two-player perfect information game. Nevertheless, Doudizhu, a three-player imperfect information game, is still quite challenging. In this paper, we present a Doudizhu AI by applying deep reinforcement learning from games of self-play. The algorithm combines an asymmetric MCTS on nodes representing each player's information set, a policy-value network that approximates the policy and value on each decision node, and inference on unobserved hands of other players by given policy. Our results show that self-play can significantly improve the performance of our agent in this multiagent imperfect information game. Even starting with a weak AI, our agent can achieve human expert level after days of self-play and training.

Cite

CITATION STYLE

APA

Jiang, Q., Li, K., Du, B., Chen, H., & Fang, H. (2019). Deltadou: Expert-level doudizhu AI through self-play. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2019-August, pp. 1265–1271). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2019/176

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free