Improving real-time bidding using a constrained markov decision process

Manxing Du; Redouane Sassioui; Georgios Varisteas; Radu State; Mats Brorsson; Omar Cherkaoui

Conference Proceedings

Improving real-time bidding using a constrained markov decision process

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10604 LNAI 711-726

DOI: 10.1007/978-3-319-69179-4_50

12Citations

17Readers

Get full text

Abstract

Online advertising is increasingly switching to real-time bidding on advertisement inventory, in which the ad slots are sold through real-time auctions upon users visiting websites or using mobile apps. To compete with unknown bidders in such a highly stochastic environment, each bidder is required to estimate the value of each impression and to set a competitive bid price. Previous bidding algorithms have done so without considering the constraint of budget limits, which we address in this paper. We model the bidding process as a Constrained Markov Decision Process based reinforcement learning framework. Our model uses the predicted click-through-rate as the state, bid price as the action, and ad clicks as the reward. We propose a bidding function, which outperforms the state-of-the-art bidding functions in terms of the number of clicks when the budget limit is low. We further simulate different bidding functions competing in the same environment and report the performances of the bidding strategies when required to adapt to a dynamic environment.

Author supplied keywords

Cite

CITATION STYLE

APA

Du, M., Sassioui, R., Varisteas, G., State, R., Brorsson, M., & Cherkaoui, O. (2017). Improving real-time bidding using a constrained markov decision process. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10604 LNAI, pp. 711–726). Springer Verlag. https://doi.org/10.1007/978-3-319-69179-4_50

Improving real-time bidding using a constrained markov decision process

Abstract

Author supplied keywords

Cite

Register to see more suggestions