Safety-aware apprenticeship learning

Weichao Zhou; Wenchao Li

Conference ProceedingsOPEN ACCESS

Safety-aware apprenticeship learning

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 10981 LNCS 662-680

DOI: 10.1007/978-3-319-96145-3_38

24Citations

24Readers

Abstract

Apprenticeship learning (AL) is a kind of Learning from Demonstration techniques where the reward function of a Markov Decision Process (MDP) is unknown to the learning agent and the agent has to derive a good policy by observing an expert’s demonstrations. In this paper, we study the problem of how to make AL algorithms inherently safe while still meeting its learning objective. We consider a setting where the unknown reward function is assumed to be a linear combination of a set of state features, and the safety property is specified in Probabilistic Computation Tree Logic (PCTL). By embedding probabilistic model checking inside AL, we propose a novel counterexample-guided approach that can ensure safety while retaining performance of the learnt policy. We demonstrate the effectiveness of our approach on several challenging AL scenarios where safety is essential.

Cite

CITATION STYLE

APA

Zhou, W., & Li, W. (2018). Safety-aware apprenticeship learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10981 LNCS, pp. 662–680). Springer Verlag. https://doi.org/10.1007/978-3-319-96145-3_38

Safety-aware apprenticeship learning

Abstract

Cite

Register to see more suggestions