Exploring Parameter Space in Reinforcement Learning

Thomas Rückstieb; Frank Sehnke; Tom Schaul; Daan Wierstra; Yi Sun; Jürgen Schmidhuber

Journal ArticleOPEN ACCESS

Exploring Parameter Space in Reinforcement Learning

Paladyn (2010) 1(1) 14-24

DOI: 10.2478/s13230-010-0002-4

53Citations

120Readers

Abstract

This paper discusses parameter-based exploration methods for reinforcement learning. Parameter-based methods perturb parameters of a general function approximator directly, rather than adding noise to the resulting actions. Parameter-based exploration unifies reinforcement learning and black-box optimization, and has several advantages over action perturbation. We review two recent parameter-exploring algorithms: Natural Evolution Strategies and Policy Gradients with Parameter-Based Exploration. Both outperform state-of-the-art algorithms in several complex high-dimensional tasks commonly found in robot control. Furthermore, we describe how a novel exploration method, State-Dependent Exploration, can modify existing algorithms to mimic exploration in parameter space.

Author supplied keywords

Cite

CITATION STYLE

APA

Rückstieb, T., Sehnke, F., Schaul, T., Wierstra, D., Sun, Y., & Schmidhuber, J. (2010). Exploring Parameter Space in Reinforcement Learning. Paladyn, 1(1), 14–24. https://doi.org/10.2478/s13230-010-0002-4

Exploring Parameter Space in Reinforcement Learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions