Inverse KKT – Learning Cost Functions of Manipulation Tasks from Demonstrations

6Citations
Citations of this article
41Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Inverse Optimal Control (IOC) assumes that demonstrations are the solution to an optimal control problem with unknown underlying costs, and extracts parameters of these underlying costs. We propose the framework of Inverse KKT, which assumes that the demonstrations fulfill the Karush–Kuhn–Tucker conditions of an unknown underlying constrained optimization problem, and extracts parameters of this underlying problem. Using this assumption, we can exploit the latter to extract the relevant task spaces and cost parameters from demonstrations of skills that involve contacts. For a typical linear parameterization of cost functions this reduces to a quadratic program, ensuring guaranteed and very efficient convergence, but we can deal also with arbitrary non-linear parameterizations of cost functions. The aim of our approach is to push learning from demonstration to more complex manipulation scenarios that include the interaction with objects and therefore the realization of contacts/constraints within the motion. We demonstrate the approach on tasks such as sliding a box and opening a door.

Cite

CITATION STYLE

APA

Englert, P., & Toussaint, M. (2018). Inverse KKT – Learning Cost Functions of Manipulation Tasks from Demonstrations. In Springer Proceedings in Advanced Robotics (Vol. 3, pp. 57–72). Springer Science and Business Media B.V. https://doi.org/10.1007/978-3-319-60916-4_4

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free