Double-oracle sampling method for stackelberg equilibrium approximation in general-sum extensive-form games

16Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.

Abstract

The paper presents a new method for approximating Strong Stackelberg Equilibrium in general-sum sequential games with imperfect information and perfect recall. The proposed approach is generic as it does not rely on any specific properties of a particular game model. The method is based on iterative interleaving of the two following phases: (1) guided Monte Carlo Tree Search sampling of the Follower's strategy space and (2) building the Leader's behavior strategy tree for which the sampled Follower's strategy is an optimal response. The above solution scheme is evaluated with respect to expected Leader's utility and time requirements on three sets of interception games with variable characteristics, played on graphs. A comparison with three state-of-the-art MILP/LPbased methods shows that in vast majority of test cases proposed simulation-based approach leads to optimal Leader's strategies, while excelling the competitive methods in terms of better time scalability and lower memory requirements.

Cite

CITATION STYLE

APA

Karwowski, J., & Mańdziuk, J. (2020). Double-oracle sampling method for stackelberg equilibrium approximation in general-sum extensive-form games. In AAAI 2020 - 34th AAAI Conference on Artificial Intelligence (pp. 2054–2061). AAAI press. https://doi.org/10.1609/aaai.v34i02.5578

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free