Purposive behavior acquisition for a real robot by vision-based reinforcement learning

Shoichi Noda

Journal ArticleOPEN ACCESS

Purposive behavior acquisition for a real robot by vision-based reinforcement learning

Noda S

Machine Learning (1996) 23(2-3) 279-303

DOI: 10.1007/bf00117447

187Citations

103Readers

Abstract

This paper presents a method of vision-based reinforcement learning by which a robot learns to shoot a ball into a goal. We discuss several issues in applying the reinforcement learning method to a real robot with vision sensor by which the robot can obtain information about the changes in an environment. First, we construct a state space in terms of size, position, and orientation of a ball and a goal in an image, and an action space is designed in terms of the action commands to be sent to the left and right motors of a mobile robot. This causes a "state-action deviation" problem in constructing the state and action spaces that reflect the outputs from physical sensors and actuators, respectively. To deal with this issue, an action set is constructed in a way that one action consists of a series of the same action primitive which is successively executed until the current state changes. Next, to speed up the learning time, a mechanism of Learning from Easy Missions (or LEM) is implemented. LEM reduces the learning time from exponential to almost linear order in the size of the state space. The results of computer simulations and real robot experiments are given. © 1996 Kluwer Academic Publishers,.

Author supplied keywords

Cite

CITATION STYLE

APA

Noda, S. (1996). Purposive behavior acquisition for a real robot by vision-based reinforcement learning. Machine Learning, 23(2–3), 279–303. https://doi.org/10.1007/bf00117447

Purposive behavior acquisition for a real robot by vision-based reinforcement learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions