An Empirical Study of Selection Bias in Pinterest Ads Retrieval

7Citations
Citations of this article
18Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Data selection bias has been a long-lasting challenge in the machine learning domain, especially in multi-stage recommendation systems, where the distribution of labeled items for model training is very different from that of the actual candidates during inference time. This distribution shift is even more prominent in the context of online advertising where the user base is diverse and the platform contains a wide range of contents. In this paper, we first investigate the data selection bias in the upper funnel (Ads Retrieval) of Pinterest's multi-cascade ads ranking system. We then conduct comprehensive experiments to assess the performance of various state-of-the-art methods, including transfer learning, adversarial learning, and unsupervised domain adaptation. Moreover, we further introduce some modifications into the unsupervised domain adaptation and evaluate the performance of different variants of this modified method. Our online A/B experiments show that the modified version of unsupervised domain adaptation (MUDA) could provide the largest improvements to the performance of Pinterest's advertisement ranking system compared with other methods and the one used in current production.

Cite

CITATION STYLE

APA

Wang, Y., Yin, P., Tao, Z., Venkatesan, H., Lai, J., Fang, Y., & Xiao, P. J. (2023). An Empirical Study of Selection Bias in Pinterest Ads Retrieval. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 5174–5183). Association for Computing Machinery. https://doi.org/10.1145/3580305.3599771

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free