Secure Best Arm Identification in Multi-armed Bandits

Radu Ciucanu; Pascal Lafourcade; Marius Lombard-Platet; Marta Soare

Conference Proceedings

Secure Best Arm Identification in Multi-armed Bandits

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11879 LNCS 152-171

DOI: 10.1007/978-3-030-34339-2_9

8Citations

85Readers

Get full text

Abstract

The stochastic multi-armed bandit is a classical decision making model, where an agent repeatedly chooses an action (pull a bandit arm) and the environment responds with a stochastic outcome (reward) coming from an unknown distribution associated with the chosen action. A popular objective for the agent is that of identifying the arm with the maximum expected reward, also known as the best-arm identification problem. We address the inherent privacy concerns that occur in a best-arm identification problem when outsourcing the data and computations to a honest-but-curious cloud. Our main contribution is a distributed protocol that computes the best arm while guaranteeing that (i) no cloud node can learn at the same time information about the rewards and about the arms ranking, and (ii) by analyzing the messages communicated between the different cloud nodes, no information can be learned about the rewards or about the ranking. In other words, the two properties ensure that the protocol has no security single point of failure. We rely on the partially homomorphic property of the well-known Paillier’s cryptosystem as a building block in our protocol. We prove the correctness of our protocol and we present proof-of-concept experiments suggesting its practical feasibility.

Author supplied keywords

Cite

CITATION STYLE

APA

Ciucanu, R., Lafourcade, P., Lombard-Platet, M., & Soare, M. (2019). Secure Best Arm Identification in Multi-armed Bandits. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11879 LNCS, pp. 152–171). Springer. https://doi.org/10.1007/978-3-030-34339-2_9

Secure Best Arm Identification in Multi-armed Bandits

Abstract

Author supplied keywords

Cite

Register to see more suggestions