Noise free multi-armed bandit game

Atsuyoshi Nakamura; David P. Helmbold; Manfred K. Warmuth

Conference Proceedings

Noise free multi-armed bandit game

Nakamura A
Helmbold D
Warmuth M

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9618 412-423

DOI: 10.1007/978-3-319-30000-9_32

1Citations

4Readers

Get full text

Abstract

We study the loss version of adversarial multi-armed bandit problems with one lossless arm. We show an adversary’s strategy that forces any player to suffer K − 1 − O(1/T) loss where K is the number of arms and T is the number of rounds.

Author supplied keywords

Algorithmic learning
Bandit problem
Online learning

Cite

CITATION STYLE

APA

Nakamura, A., Helmbold, D. P., & Warmuth, M. K. (2016). Noise free multi-armed bandit game. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9618, pp. 412–423). Springer Verlag. https://doi.org/10.1007/978-3-319-30000-9_32

Noise free multi-armed bandit game

Abstract

Author supplied keywords

Cite

Register to see more suggestions