How Expert Confidence Can Improve Collective Decision-Making in Contextual Multi-Armed Bandit Problems

2Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In collective decision-making (CDM) a group of experts with a shared set of values and a common goal must combine their knowledge to make a collectively optimal decision. Whereas existing research on CDM primarily focuses on making binary decisions, we focus here on CDM applied to solving contextual multi-armed bandit (CMAB) problems, where the goal is to exploit contextual information to select the best arm among a set. To address the limiting assumptions of prior work, we introduce confidence estimates and propose a novel approach to deciding with expert advice which can take advantage of these estimates. We further show that, when confidence estimates are imperfect, the proposed approach is more robust than the classical confidence-weighted majority vote.

Cite

CITATION STYLE

APA

Abels, A., Lenaerts, T., Trianni, V., & Nowé, A. (2020). How Expert Confidence Can Improve Collective Decision-Making in Contextual Multi-Armed Bandit Problems. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12496 LNAI, pp. 125–138). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-63007-2_10

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free