Reinforcement learning with heuristic to solve POMDP problem in mobile robot path planning

  • Adiprawita W
  • Ahmad A
  • Sembiring J
 et al. 
  • 7


    Mendeley users who have this article in their library.
  • 3


    Citations of this article.


In this paper we propose a method of presenting a special case of Value Function as a solution to POMDP in mobile robot navigation. By using this new method the Value Function complexity will be reduced and more intuitive. We also propose a new reinforcement learning method to solve the Value Function. This reinforcement learning is based on Bellman Equation augmented with A* like heuristic during update iteration. The result of this new Value Function is validated with This particle filter is simulaed in Matlab and also experimented physically using a simple autonomous mobile robot built with Lego Mindstorms NXT with 3 ultrasonic sonar and RWTH Mindstorms NXT Toolbox for Matlab to connect the robot to Matlab. This simulation and experiment also incorporate particle filter localization from previous research. The simulation and experiment show that the Value Function can be utilized very well.

Author-supplied keywords

  • LEGO Mindstorm NXR
  • RWTH toolbox
  • autonomous mobile robot
  • navigation
  • value function
  • value iteration

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document

Get full text


  • Widyawardana Adiprawita

  • Adang Suwandi Ahmad

  • Jaka Sembiring

  • Bambang R. Trilaksono

Cite this document

Choose a citation style from the tabs below

Save time finding and organizing research with Mendeley

Sign up for free