Evolutionary dynamics of Q-learning over the sequence form

Fabio Panozzo; Nicola Gatti; Marcello Restelli

Conference ProceedingsOPEN ACCESS

Evolutionary dynamics of Q-learning over the sequence form

Proceedings of the National Conference on Artificial Intelligence (2014) 3 2034-2040

DOI: 10.1609/aaai.v28i1.9012

12Citations

5Readers

Abstract

Multi-agent learning is a challenging open task in artificial intelligence. It is known an interesting connection between multi-agent learning algorithms and evolutionary game theory, showing that the learning dynamics of some algorithms can be modeled as replicator dynamics with a mutation term. Inspired by the recent sequence-form replicator dynamics, we develop a new version of the Q-learning algorithm working on the sequence form of an extensive-form game allowing thus an exponential reduction of the dynamics length w.r.t. those of the normal form. The dynamics of the proposed algorithm can be modeled by using the sequenceform replicator dynamics with a mutation term. We show that, although sequence-form and normal-form replicator dynamics are realization equivalent, the Q-learning algorithm applied to the two forms have nonrealization equivalent dynamics. Originally from the previous works on evolutionary game theory models form multi-agent learning, we produce an experimental evaluation to show the accuracy of the model.

Cite

CITATION STYLE

APA

Panozzo, F., Gatti, N., & Restelli, M. (2014). Evolutionary dynamics of Q-learning over the sequence form. In Proceedings of the National Conference on Artificial Intelligence (Vol. 3, pp. 2034–2040). AI Access Foundation. https://doi.org/10.1609/aaai.v28i1.9012

Evolutionary dynamics of Q-learning over the sequence form

Abstract

Cite

Register to see more suggestions