Dealing with multiple experts and non-stationarity in inverse reinforcement learning: an application to real-life problems

24Citations
Citations of this article
32Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be fundamental to understand how possibly conflicting objectives are managed, helping to interpret the demonstrated behavior. In this paper, we discuss how inverse reinforcement learning (IRL) can be employed to retrieve the reward function implicitly optimized by expert agents acting in real applications. Scaling IRL to real-world cases has proved challenging as typically only a fixed dataset of demonstrations is available and further interactions with the environment are not allowed. For this reason, we resort to a class of truly batch model-free IRL algorithms and we present three application scenarios: (1) the high-level decision-making problem in the highway driving scenario, and (2) inferring the user preferences in a social network (Twitter), and (3) the management of the water release in the Como Lake. For each of these scenarios, we provide formalization, experiments and a discussion to interpret the obtained results.

References Powered by Scopus

Markov decision processes: Discrete stochastic dynamic programming

7331Citations
N/AReaders
Get full text

Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning

6383Citations
N/AReaders
Get full text

A survey of robot learning from demonstration

2652Citations
N/AReaders
Get full text

Cited by Powered by Scopus

A Tutorial on Internet of Behaviors: Concept, Architecture, Technology, Applications, and Challenges

33Citations
N/AReaders
Get full text

Online Learning Human Behavior for a Class of Human-in-the-Loop Systems via Adaptive Inverse Optimal Control

23Citations
N/AReaders
Get full text

Bankruptcy-evolutionary games based solution for the multi-agent credit assignment problem

14Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Likmeta, A., Metelli, A. M., Ramponi, G., Tirinzoni, A., Giuliani, M., & Restelli, M. (2021). Dealing with multiple experts and non-stationarity in inverse reinforcement learning: an application to real-life problems. Machine Learning, 110(9), 2541–2576. https://doi.org/10.1007/s10994-020-05939-8

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 15

75%

Lecturer / Post doc 2

10%

Researcher 2

10%

Professor / Associate Prof. 1

5%

Readers' Discipline

Tooltip

Computer Science 11

55%

Engineering 7

35%

Decision Sciences 1

5%

Social Sciences 1

5%

Save time finding and organizing research with Mendeley

Sign up for free