The existing reinforcement learning approaches have been suffering from the policy alternation of others in multiagent dynamic environments such as RoboCup competitions since other agent behaviors may cause sudden changes of state transition probabilities of which constancy is necessary for the learning to converge. A modular learning approach would be able to solve this problem if a learning agent can assign each module to one situation in which the module can regard the state transition probabilities as constant. This paper presents a method of modular learning in a multiagent environment, by which the learning agent can adapt its behaviors to the situations as results of the other agent's behaviors. Scheduling for learning is introduced to avoid the complexity in autonomous situation assignment. © Springer-Verlag Berlin Heidelberg 2005.
CITATION STYLE
Takahashi, Y., Edazawa, K., & Asada, M. (2005). Modular learning system and scheduling for behavior acquisition in multi-agent environment. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 3276, pp. 548–555). Springer Verlag. https://doi.org/10.1007/978-3-540-32256-6_51
Mendeley helps you to discover research relevant for your work.