Robots can use information from people to improve learning speed or quality. However, people can have short attention spans and misunderstand tasks. Our work addresses these issues with algorithms for learning from inattentive teachers that take advantage of feedback when people are present, and an algorithm for learning from inaccurate teachers that estimates which state-action pairs receive incorrect feedback. These advances will enhance robots' ability to take advantage of imperfect feedback from human teachers.
CITATION STYLE
Kessler Faulkner, T. A., & Thomaz, A. (2021). Interactive reinforcement learning from imperfect teachers. In ACM/IEEE International Conference on Human-Robot Interaction (pp. 577–579). IEEE Computer Society. https://doi.org/10.1145/3434074.3446361
Mendeley helps you to discover research relevant for your work.