Linear submodular bandits with a knapsack constraint

18Citations
Citations of this article
26Readers
Mendeley users who have this article in their library.

Abstract

Linear submodular bandits has been proven to be effective in solving the diversification and feature-based exploration problems in retrieval systems. Concurrently, many web-based applications, such as news article recommendation and online ad placement, can be modeled as budget-limited problems. However, the diversification problem under a budget constraint has not been considered. In this paper, we first introduce the budget constraint to linear submodular bandits as a new problem called the linear submodular bandits with a knapsack constraint. We then define an α-approximation unit-cost regret considering that submodular function maximization is NP-hard. To solve this problem, we propose two greedy algorithms based on a modified UCB rule.We then prove these two algorithms with different regret bounds and computational costs. We also conduct a number of experiments and the experimental results confirm our theoretical analyses.

Cite

CITATION STYLE

APA

Yu, B., Fang, M., & Tao, D. (2016). Linear submodular bandits with a knapsack constraint. In 30th AAAI Conference on Artificial Intelligence, AAAI 2016 (pp. 1380–1386). AAAI press. https://doi.org/10.1609/aaai.v30i1.10154

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free