Multi-armed bandits with fairness constraints for distributing resources to human teammates

Houston Claure; Yifang Chen; Jignesh Modi; Malte Jung; Stefanos Nikolaidis

Conference ProceedingsOPEN ACCESS

Multi-armed bandits with fairness constraints for distributing resources to human teammates

ACM/IEEE International Conference on Human-Robot Interaction (2020) 299-308

DOI: 10.1145/3319502.3374806

28Citations

62Readers

Get full text

Abstract

How should a robot that collaborates with multiple people decide upon the distribution of resources (e.g. social attention, or parts needed for an assembly)? People are uniquely attuned to how resources are distributed. A decision to distribute more resources to one team member than another might be perceived as unfair with potentially detrimental effects for trust. We introduce a multiarmed bandit algorithm with fairness constraints, where a robot distributes resources to human teammates of different skill levels. In this problem, the robot does not know the skill level of each human teammate, but learns it by observing their performance over time. We define fairness as a constraint on the minimum rate that each human teammate is selected throughout the task. We provide theoretical guarantees on performance and perform a large-scale user study, where we adjust the level of fairness in our algorithm. Results show that fairness in resource distribution has a significant effect on users' trust in the system.

Author supplied keywords

Cite

CITATION STYLE

APA

Claure, H., Chen, Y., Modi, J., Jung, M., & Nikolaidis, S. (2020). Multi-armed bandits with fairness constraints for distributing resources to human teammates. In ACM/IEEE International Conference on Human-Robot Interaction (pp. 299–308). IEEE Computer Society. https://doi.org/10.1145/3319502.3374806

Multi-armed bandits with fairness constraints for distributing resources to human teammates

Abstract

Author supplied keywords

Cite

Register to see more suggestions