Forward and bidirectional planning based on reinforcement learning and neural networks in a simulated robot.

Gianluca Baldassarre

Conference Proceedings

Forward and bidirectional planning based on reinforcement learning and neural networks in a simulated robot.

Baldassarre G

Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (2003) 2684 179-200

DOI: 10.1007/978-3-540-45002-3_11

20Citations

14Readers

Get full text

Abstract

Building intelligent systems that are capable of learning, acting reac-tively and planning actions before their execution is a major goal of artificial intelligence. This paper presents two reactive and planning systems that contain important novelties with respect to previous neural-network planners and rein-forcement-learning based planners: (a) the introduction of a new component (.matcher.) allows both planners to execute genuine taskable planning (while previous reinforcement-learning based models have used planning only for speeding up learning); (b) the planners show for the first time that trained neu-ral-network models of the world can generate long prediction chains that have an interesting robustness with regards to noise; (c) two novel algorithms that generate chains of predictions in order to plan, and control the flows of infor-mation between the systems. different neural components, are presented; (d) one of the planners uses backward .predictions. to exploit the knowledge of the pursued goal; (e) the two systems presented nicely integrate reactive behav-ior and planning on the basis of a measure of .confidence. in action. The soundness and potentialities of the two reactive and planning systems are tested and compared with a simulated robot engaged in a stochastic path-finding task. The paper also presents an extensive literature review on the relevant issues.

Cite

CITATION STYLE

APA

Baldassarre, G. (2003). Forward and bidirectional planning based on reinforcement learning and neural networks in a simulated robot. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 2684, pp. 179–200). Springer Verlag. https://doi.org/10.1007/978-3-540-45002-3_11

Forward and bidirectional planning based on reinforcement learning and neural networks in a simulated robot.

Abstract

Cite

Register to see more suggestions