Though limited in real-world decision making, most multi-agent reinforcement learning (MARL) models assume perfectly rational agents - a property hardly met due to individuals' cognitive limitation and/or the tractability of the decision problem. In this paper, we introduce generalized recursive reasoning (GR2) as a novel framework to model agents with different hierarchical levels of rationality; our framework enables agents to exhibit varying levels of “thinking” ability thereby allowing higher-level agents to best respond to various less sophisticated learners. We contribute both theoretically and empirically. On the theory side, we devise the hierarchical framework of GR2 through probabilistic graphical models and prove the existence of a perfect Bayesian equilibrium. Within the GR2, we propose a practical actor-critic solver, and demonstrate its convergent property to a stationary point in two-player games through Lyapunov analysis. On the empirical side, we validate our findings on a variety of MARL benchmarks. Precisely, we first illustrate the hierarchical thinking process on the Keynes Beauty Contest, and then demonstrate significant improvements compared to state-of-the-art opponent modeling baselines on the normal-form games and the cooperative navigation benchmark.
CITATION STYLE
Wen, Y., Yang, Y., & Wang, J. (2020). Modelling bounded rationality in multi-agent interactions by generalized recursive reasoning. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2021-January, pp. 414–421). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2020/58
Mendeley helps you to discover research relevant for your work.