The Application of AlphaZero to Wargaming

Glennn Moy; Slava Shekh

Conference Proceedings

The Application of AlphaZero to Wargaming

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11919 LNAI 3-14

DOI: 10.1007/978-3-030-35288-2_1

9Citations

12Readers

Get full text

Abstract

In this paper, we explore the process of automatically learning to play wargames using AlphaZero deep reinforcement learning. We consider a simple wargame, Coral Sea, which is a turn-based game played on a hexagonal grid between two players. We explore the differences between Coral Sea and traditional board games, where the successful use of AlphaZero has been demonstrated. Key differences include: problem representation, wargame asymmetry, limited strategic depth, and the requirement for significant hardware resources. We demonstrate how bootstrapping AlphaZero with supervised learning can overcome these challenges. In the context of Coral Sea, this enables AlphaZero to learn optimal play and outperform the supervised examples on which it was trained.

Author supplied keywords

Cite

CITATION STYLE

APA

Moy, G., & Shekh, S. (2019). The Application of AlphaZero to Wargaming. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11919 LNAI, pp. 3–14). Springer. https://doi.org/10.1007/978-3-030-35288-2_1

The Application of AlphaZero to Wargaming

Abstract

Author supplied keywords

Cite

Register to see more suggestions