Swarm reinforcement learning method based on an actor-critic method

1Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We recently proposed swarm reinforcement learning methods in which multiple agents are prepared and they learn not only by individual learning but also by learning through exchanging information among the agents. The methods have been applied to a problem in discrete state-action space so far, and Q-learning method has been used as the individual learning. Although many studies in reinforcement learning have been done for problems in the discrete state-action space, continuous state-action space is required for coping with most real-world tasks. This paper proposes a swarm reinforcement learning method based on an actor-critic method in order to acquire optimal policies rapidly for problems in the continuous state-action space. The proposed method is applied to an inverted pendulum control problem, and its performance is examined through numerical experiments. © 2010 Springer-Verlag.

Cite

CITATION STYLE

APA

Iima, H., & Kuroe, Y. (2010). Swarm reinforcement learning method based on an actor-critic method. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6457 LNCS, pp. 279–288). https://doi.org/10.1007/978-3-642-17298-4_29

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free