Random sampling of states in dynamic programming

Christopher G. Atkeson; Benjamin J. Stephens

Journal Article

Random sampling of states in dynamic programming

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics (2008) 38(4) 924-929

DOI: 10.1109/TSMCB.2008.926610

40Citations

132Readers

Get full text

Abstract

We combine three threads of research on approximate dynamic programming: sparse random sampling of states, value function and policy approximation using local models, and using local trajectory optimizers to globally optimize a policy and associated value function. Our focus is on finding steady-state policies for deterministic time-invariant discrete time control problems with continuous states and actions often found in robotics. In this paper, we describe our approach and provide initial results on several simulated robotics problems. © 2008 IEEE.

Author supplied keywords

Cite

CITATION STYLE

APA

Atkeson, C. G., & Stephens, B. J. (2008). Random sampling of states in dynamic programming. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 38(4), 924–929. https://doi.org/10.1109/TSMCB.2008.926610

Random sampling of states in dynamic programming

Abstract

Author supplied keywords

Cite

Register to see more suggestions