Reinforcing Pretrained Models for Generating Attractive Text Advertisements

25Citations
Citations of this article
29Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We study how pretrained language models can be enhanced by using deep reinforcement learning to generate attractive text advertisements that reach the high quality standard of real-world advertiser mediums. To improve ad attractiveness without hampering user experience, we propose a model-based reinforcement learning framework for text ad generation, which constructs a model for the environment dynamics and avoids large sample complexity. Based on the framework, we develop Masked-Sequence Policy Gradient, a reinforcement learning algorithm that integrates efficiently with pretrained models and explores the action space effectively. Our method has been deployed to production in Microsoft Bing. Automatic offline experiments, human evaluation, and online experiments demonstrate the superior performance of our method.

Cite

CITATION STYLE

APA

Wang, X., Gu, X., Cao, J., Zhao, Z., Yan, Y., Middha, B., & Xie, X. (2021). Reinforcing Pretrained Models for Generating Attractive Text Advertisements. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 3697–3707). Association for Computing Machinery. https://doi.org/10.1145/3447548.3467105

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free