Approximating adaptive dynamic programming has been studied extensively in recent years for its potential scalability to solve problems involving continuous state and action spaces. The framework of adaptive critic design (ACD) addresses this issue and has been demonstrated in several case studies. The present paper proposes an implementation of ACD using an echo state network as the critic. The ESN is trained online to estimate the utility function and adapt the control policy of an embodied agent. In addition to its simple training algorithm, the ESN structure facilitates backpropagation of derivatives needed for adapting the controller. Experimental results using a mobile robot are provided to validate the proposed learning architecture. © 2012 Springer-Verlag.
CITATION STYLE
Oubbati, M., Uhlemann, J., & Palm, G. (2012). Adaptive learning in continuous environment using actor-critic design and echo-state networks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7426 LNAI, pp. 320–329). https://doi.org/10.1007/978-3-642-33093-3_32
Mendeley helps you to discover research relevant for your work.