AutoCFR: Learning to Design Counterfactual Regret Minimization Algorithms

Hang Xu; Kai Li; Haobo Fu; Qiang Fu; Junliang Xing

Conference ProceedingsOPEN ACCESS

AutoCFR: Learning to Design Counterfactual Regret Minimization Algorithms

Xu H
Li K
Fu H
et al.

Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022 (2022) 36 5244-5251

DOI: 10.1609/aaai.v36i5.20460

8Citations

8Readers

Abstract

Counterfactual regret minimization (CFR) is the most commonly used algorithm to approximately solving two-player zero-sum imperfect-information games (IIGs). In recent years, a series of novel CFR variants such as CFR+, Linear CFR, DCFR have been proposed and have significantly improved the convergence rate of the vanilla CFR. However, most of these new variants are hand-designed by researchers through trial and error based on different motivations, which generally requires a tremendous amount of efforts and insights. This work proposes to meta-learn novel CFR algorithms through evolution to ease the burden of manual algorithm design. We first design a search language that is rich enough to represent many existing hand-designed CFR variants. We then exploit a scalable regularized evolution algorithm with a bag of acceleration techniques to efficiently search over the combinatorial space of algorithms defined by this language. The learned novel CFR algorithm can generalize to new IIGs not seen during training and performs on par with or better than existing state-of-the-art CFR variants. The code is available at https://github.com/rpSebastian/AutoCFR.

Cite

CITATION STYLE

APA

Xu, H., Li, K., Fu, H., Fu, Q., & Xing, J. (2022). AutoCFR: Learning to Design Counterfactual Regret Minimization Algorithms. In Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022 (Vol. 36, pp. 5244–5251). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v36i5.20460

AutoCFR: Learning to Design Counterfactual Regret Minimization Algorithms

Abstract

Cite

Register to see more suggestions