Reinforcing Pretrained Models for Generating Attractive Text Advertisements

Xiting Wang; Xinwei Gu; Jie Cao; Zihua Zhao; Yulan Yan; Bhuvan Middha; Xing Xie

Conference ProceedingsOPEN ACCESS

Reinforcing Pretrained Models for Generating Attractive Text Advertisements

Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2021) 3697-3707

DOI: 10.1145/3447548.3467105

25Citations

29Readers

Get full text

Abstract

We study how pretrained language models can be enhanced by using deep reinforcement learning to generate attractive text advertisements that reach the high quality standard of real-world advertiser mediums. To improve ad attractiveness without hampering user experience, we propose a model-based reinforcement learning framework for text ad generation, which constructs a model for the environment dynamics and avoids large sample complexity. Based on the framework, we develop Masked-Sequence Policy Gradient, a reinforcement learning algorithm that integrates efficiently with pretrained models and explores the action space effectively. Our method has been deployed to production in Microsoft Bing. Automatic offline experiments, human evaluation, and online experiments demonstrate the superior performance of our method.

Author supplied keywords

Cite

CITATION STYLE

APA

Wang, X., Gu, X., Cao, J., Zhao, Z., Yan, Y., Middha, B., & Xie, X. (2021). Reinforcing Pretrained Models for Generating Attractive Text Advertisements. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 3697–3707). Association for Computing Machinery. https://doi.org/10.1145/3447548.3467105

Reinforcing Pretrained Models for Generating Attractive Text Advertisements

Abstract

Author supplied keywords

Cite

Register to see more suggestions