Abstract
We study dynamic allocation problems for discrete time multi-armed bandits under uncertainty, based on the the theory of nonlinear expectations. We show that, under independence assumption on the bandits and with some relaxation in the definition of optimality, a Gittins allocation index gives optimal choices. This involves studying the interaction of our uncertainty with controls which determine the filtration. We also run a simple numerical example which illustrates the interaction between the willingness to explore and uncertainty aversion of the agent when making decisions.
Author supplied keywords
Cite
CITATION STYLE
Cohen, S. N., & Treetanthiploet, T. (2022). Gittins’ theorem under uncertainty. Electronic Journal of Probability, 27. https://doi.org/10.1214/22-EJP742
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.