Variational adversarial kernel learned imitation learning

Fan Yang; Alina Vereshchaka; Yufan Zhou; Changyou Chen; Wen Dong

Conference ProceedingsOPEN ACCESS

Variational adversarial kernel learned imitation learning

AAAI 2020 - 34th AAAI Conference on Artificial Intelligence (2020) 6599-6606

DOI: 10.1609/aaai.v34i04.6135

9Citations

12Readers

Abstract

Imitation learning refers to the problem where an agent learns to perform a task through observing and mimicking expert demonstrations, without knowledge of the cost function. State-of-the-art imitation learning algorithms reduce imitation learning to distribution-matching problems by minimizing some distance measures. However, the distance measure may not always provide informative signals for a policy update. To this end, we propose the variational adversarial kernel learned imitation learning (VAKLIL), which measures the distance using the maximum mean discrepancy with variational kernel learning. Our method optimizes over a large cost-function space and is sample efficient and robust to overfitting. We demonstrate the performance of our algorithm through benchmarking with four state-of-the-art imitation learning algorithms over five high-dimensional control tasks, and a complex transportation control task. Experimental results indicate that our algorithm significantly outperforms related algorithms in all scenarios.

Cite

CITATION STYLE

APA

Yang, F., Vereshchaka, A., Zhou, Y., Chen, C., & Dong, W. (2020). Variational adversarial kernel learned imitation learning. In AAAI 2020 - 34th AAAI Conference on Artificial Intelligence (pp. 6599–6606). AAAI press. https://doi.org/10.1609/aaai.v34i04.6135

Variational adversarial kernel learned imitation learning

Abstract

Cite

Register to see more suggestions