Adaptive Exploration through Covariance Matrix Adaptation Enables Developmental Motor Learning

Freek Stulp; Pierre Yves Oudeyer

Journal ArticleOPEN ACCESS

Adaptive Exploration through Covariance Matrix Adaptation Enables Developmental Motor Learning

Paladyn (2012) 3(3) 128-135

DOI: 10.2478/s13230-013-0108-6

5Citations

22Readers

Abstract

The "Policy Improvement with Path Integrals"(PI2) [25] and "Covariance Matrix Adaptation-Evolutionary Strategy"[8] are considered to be state-of-the-art in direct reinforcement learning and stochastic optimization respectively. We have recently shown that incorporating covariance matrix adaptation into PI2-which yields the PICMA2 algorithm-enables adaptive exploration by continually and autonomously reconsidering the exploration/exploitation trade-off. In this article, we provide an overview of our recent work on covariance matrix adaptation for direct reinforcement learning [22-24], highlight its relevance to developmental robotics, and conduct further experiments to analyze the results. We investigate two complementary phenomena from developmental robotics. First, we demonstrate PICMA2's ability to adapt to slowly or abruptly changing tasks due to its continual and adaptive exploration. This is an important component of life-long skill learning in dynamic environments. Second, we show on a reaching task PICMA2 how subsequently releases degrees of freedom from proximal to more distal limbs as learning progresses. A similar effect is observed in human development, where it is known as 'proximodistal maturation'.

Author supplied keywords

Cite

CITATION STYLE

APA

Stulp, F., & Oudeyer, P. Y. (2012). Adaptive Exploration through Covariance Matrix Adaptation Enables Developmental Motor Learning. Paladyn, 3(3), 128–135. https://doi.org/10.2478/s13230-013-0108-6

Adaptive Exploration through Covariance Matrix Adaptation Enables Developmental Motor Learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions