Verifiable and interpretable reinforcement learning through program synthesis

Abhinav Verma

Conference ProceedingsOPEN ACCESS

Verifiable and interpretable reinforcement learning through program synthesis

Verma A

33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019 (2019) 9902-9903

DOI: 10.1609/aaai.v33i01.33019902

9Citations

24Readers

Abstract

We study the problem of generating interpretable and verifiable policies for Reinforcement Learning (RL). Unlike the popular Deep Reinforcement Learning (DRL) paradigm, in which the policy is represented by a neural network, the aim of this work is to find policies that can be represented in high-level programming languages. Such programmatic policies have several benefits, including being more easily interpreted than neural networks, and being amenable to verification by scalable symbolic methods. The generation methods for programmatic policies also provide a mechanism for systematically using domain knowledge for guiding the policy search. The interpretability and verifiability of these policies provides the opportunity to deploy RL based solutions in safety critical environments. This thesis draws on, and extends, work from both the machine learning and formal methods communities.

Cite

CITATION STYLE

APA

Verma, A. (2019). Verifiable and interpretable reinforcement learning through program synthesis. In 33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019 (pp. 9902–9903). AAAI Press. https://doi.org/10.1609/aaai.v33i01.33019902

Verifiable and interpretable reinforcement learning through program synthesis

Abstract

Cite

Register to see more suggestions