Towards Practical Lipschitz Bandits

11Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Stochastic Lipschitz bandit algorithms balance exploration and exploitation, and have been used for a variety of important task domains. In this paper, we present a framework for Lipschitz bandit methods that adaptively learns partitions of context-and arm-space. Due to this flexibility, the algorithm is able to efficiently optimize rewards and minimize regret, by focusing on the portions of the space that are most relevant. In our analysis, we link tree-based methods to Gaussian processes. In light of our analysis, we design a novel hierarchical Bayesian model for Lipschitz bandit problems. Our experiments show that our algorithms can achieve state-of-the-art performance in challenging real-world tasks such as neural network hyperparameter tuning.

Cite

CITATION STYLE

APA

Wang, T., Ye, W., Geng, D., & Rudin, C. (2020). Towards Practical Lipschitz Bandits. In FODS 2020 - Proceedings of the 2020 ACM-IMS Foundations of Data Science Conference (pp. 129–138). Association for Computing Machinery, Inc. https://doi.org/10.1145/3412815.3416885

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free