The conventional reinforcement learning approaches have difficulties to handle the policy alternation of the opponents because it may cause dynamic changes of state transition probabilities of which stability is necessary for the learning to converge. This paper presents a method of multi-module reinforcement learning in a multiagent environment, by which the learning agent can adapt itself to the policy changes of the opponents. We show a preliminary result of a simple soccer situation in the context of RoboCup. © Springer-Verlag Berlin Heidelberg 2003.
CITATION STYLE
Takahashi, Y., Edazawa, K., & Asada, M. (2003). Behavior acquisition based on multi-module learning system in multi-agent environment. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 2752, pp. 435–442). Springer Verlag. https://doi.org/10.1007/978-3-540-45135-8_39
Mendeley helps you to discover research relevant for your work.