Abstract
We combine three threads of research on approximate dynamic programming: sparse random sampling of states, value function and policy approximation using local models, and using local trajectory optimizers to globally optimize a policy and associated value function. Our focus is on finding steady-state policies for deterministic time-invariant discrete time control problems with continuous states and actions often found in robotics. In this paper, we describe our approach and provide initial results on several simulated robotics problems. © 2008 IEEE.
Author supplied keywords
Cite
CITATION STYLE
Atkeson, C. G., & Stephens, B. J. (2008). Random sampling of states in dynamic programming. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 38(4), 924–929. https://doi.org/10.1109/TSMCB.2008.926610
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.