Swarm reinforcement learning method based on an actor-critic method

Hitoshi Iima; Yasuaki Kuroe

Conference Proceedings

Swarm reinforcement learning method based on an actor-critic method

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 6457 LNCS 279-288

DOI: 10.1007/978-3-642-17298-4_29

1Citations

3Readers

Get full text

Abstract

We recently proposed swarm reinforcement learning methods in which multiple agents are prepared and they learn not only by individual learning but also by learning through exchanging information among the agents. The methods have been applied to a problem in discrete state-action space so far, and Q-learning method has been used as the individual learning. Although many studies in reinforcement learning have been done for problems in the discrete state-action space, continuous state-action space is required for coping with most real-world tasks. This paper proposes a swarm reinforcement learning method based on an actor-critic method in order to acquire optimal policies rapidly for problems in the continuous state-action space. The proposed method is applied to an inverted pendulum control problem, and its performance is examined through numerical experiments. © 2010 Springer-Verlag.

Cite

CITATION STYLE

APA

Iima, H., & Kuroe, Y. (2010). Swarm reinforcement learning method based on an actor-critic method. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6457 LNCS, pp. 279–288). https://doi.org/10.1007/978-3-642-17298-4_29

Swarm reinforcement learning method based on an actor-critic method

Abstract

Cite

Register to see more suggestions