Adaptive supervisor: Method of reinforcement learning fault elimination by application of supervised learning

6Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

Reinforcement Learning (RL) is a popular approach for solving increasing number of problems. However, standard RL approach has many deficiencies. In this paper multiple approaches for addressing those deficiencies by incorporating Supervised Learning are discussed and a new approach, Reinforcement Learning with Adaptive Supervisor, is proposed. In this model, actions chosen by the RL method are rated by the supervisor and may be replaced with safer ones. The supervisor observes the results of each action and on that basis it learns the knowledge about safety of actions in various states. It helps to overcome one of the Reinforcement Learning deficiencies-risk of wrong action execution. The new approach is designed for domains, where failures are very expensive. The architecture was evaluated on a car intersection model. The proposed method eliminated around 50% of failures.

Cite

CITATION STYLE

APA

Krzvsztoń, M. (2018). Adaptive supervisor: Method of reinforcement learning fault elimination by application of supervised learning. In Proceedings of the 2018 Federated Conference on Computer Science and Information Systems, FedCSIS 2018 (pp. 139–143). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.15439/2018F236

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free