Learning from demonstrations with high-level side information

Min Wen; Ivan Papusha; Ufuk Topcu

Conference Proceedings

Learning from demonstrations with high-level side information

IJCAI International Joint Conference on Artificial Intelligence (2017) 0 3055-3061

DOI: 10.24963/ijcai.2017/426

26Citations

29Readers

Get full text

Abstract

We consider the problem of learning from demonstration, where extra side information about the demonstration is encoded as a co-safe linear temporal logic formula. We address two known limitations of existing methods that do not account for such side information. First, the policies that result from existing methods, while matching the expected features or likelihood of the demonstrations, may still be in conflict with high-level objectives not explicit in the demonstration trajectories. Second, existing methods fail to provide a priori guarantees on the out-of-sample generalization performance with respect to such high-level goals. This lack of formal guarantees can prevent the application of learning from demonstration to safety-critical systems, especially when inference to state space regions with poor demonstration coverage is required. In this work, we show that side information, when explicitly taken into account, indeed improves the performance and safety of the learned policy with respect to task implementation. Moreover, we describe an automated procedure to systematically generate the features that encode side information expressed in temporal logic.

Cite

CITATION STYLE

APA

Wen, M., Papusha, I., & Topcu, U. (2017). Learning from demonstrations with high-level side information. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 0, pp. 3055–3061). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2017/426

Learning from demonstrations with high-level side information

Abstract

Cite

Register to see more suggestions