Adaptive learning in continuous environment using actor-critic design and echo-state networks

Mohamed Oubbati; Johannes Uhlemann; Günther Palm

Conference Proceedings

Adaptive learning in continuous environment using actor-critic design and echo-state networks

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7426 LNAI 320-329

DOI: 10.1007/978-3-642-33093-3_32

1Citations

10Readers

Get full text

Abstract

Approximating adaptive dynamic programming has been studied extensively in recent years for its potential scalability to solve problems involving continuous state and action spaces. The framework of adaptive critic design (ACD) addresses this issue and has been demonstrated in several case studies. The present paper proposes an implementation of ACD using an echo state network as the critic. The ESN is trained online to estimate the utility function and adapt the control policy of an embodied agent. In addition to its simple training algorithm, the ESN structure facilitates backpropagation of derivatives needed for adapting the controller. Experimental results using a mobile robot are provided to validate the proposed learning architecture. © 2012 Springer-Verlag.

Cite

CITATION STYLE

APA

Oubbati, M., Uhlemann, J., & Palm, G. (2012). Adaptive learning in continuous environment using actor-critic design and echo-state networks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7426 LNAI, pp. 320–329). https://doi.org/10.1007/978-3-642-33093-3_32

Adaptive learning in continuous environment using actor-critic design and echo-state networks

Abstract

Cite

Register to see more suggestions