Reinforcement learning has had great empirical success in different domains, which has left theoretical foundations, such as performance guarantees, lagging behind. The usual asymptotic convergence to an optimal policy is not strong enough for applications in the real world. Meta learning algorithms aim to use experience from multiple tasks to increase performance on all tasks individually and decrease time taken to reach an acceptable policy. This paper proposes to study the provable properties of meta-reinforcement learning.
CITATION STYLE
Mahony, A. (2018). Formalising performance guarantees in meta-reinforcement learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11232 LNCS, pp. 469–472). Springer Verlag. https://doi.org/10.1007/978-3-030-02450-5_37
Mendeley helps you to discover research relevant for your work.