Problem decomposition for behavioural cloning

6Citations
Citations of this article
15Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

In behavioural cloning of the human operator’s skill, a controller is usually induced directly as a classifier from system’s states into actions. Experience shows that this often results in brittle controllers. In this paper we explore a decomposition of the cloning problem into two learning problems: the learning of operator’s control trajectories and the learning of the system’s dynamics separately. We analyse advantages of such indirect controllers. We give characterization of the learner’s error that is plausible explanation of why this decomposition approach has empirically proved to be usually superior to direct cloning.

Cite

CITATION STYLE

APA

Šuc, D., & Bratko, I. (2000). Problem decomposition for behavioural cloning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1810, pp. 382–391). Springer Verlag. https://doi.org/10.1007/3-540-45164-1_39

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free