Reinforcement learning has received much attention for learning controllers of deterministic systems. We consider a learner-verifier framework for stochastic control systems and survey recent methods that formally guarantee a conjunction of reachability and safety properties. Given a property and a lower bound on the probability of the property being satisfied, our framework jointly learns a control policy and a formal certificate to ensure the satisfaction of the property with a desired probability threshold. Both the control policy and the formal certificate are continuous functions from states to reals, which are learned as parameterized neural networks. While in the deterministic case, the certificates are invariant and barrier functions for safety, or Lyapunov and ranking functions for liveness, in the stochastic case the certificates are supermartingales. For certificate verification, we use interval arithmetic abstract interpretation to bound the expected values of neural network functions.
CITATION STYLE
Chatterjee, K., Henzinger, T. A., Lechner, M., & Žikelić, Đ. (2023). A Learner-Verifier Framework for Neural Network Controllers and Certificates of Stochastic Systems. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13993 LNCS, pp. 3–25). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-30823-9_1
Mendeley helps you to discover research relevant for your work.