Cascade Model-based Propensity Estimation for Counterfactual Learning to Rank

39Citations
Citations of this article
27Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Unbiased counterfactual learning to rank (CLTR) requires click propensities to compensate for the difference between user clicks and true relevance of search results via inverse propensity scoring (IPS). Current propensity estimation methods assume that user click behavior follows the position-based click model (PBM) and estimate click propensities based on this assumption. However, in reality, user clicks often follow the cascade model (CM), where users scan search results from top to bottom and where each next click depends on the previous one. In this cascade scenario, PBM-based estimates of propensities are not accurate, which, in turn, hurts CLTR performance. In this paper, we propose a propensity estimation method for the cascade scenario, called cascade model-based inverse propensity scoring (CM-IPS). We show that CM-IPS keeps CLTR performance close to the full-information performance in case the user clicks follow the CM, while PBM-based CLTR has a significant gap towards the full-information. The opposite is true if the user clicks follow PBM instead of the CM. Finally, we suggest a way to select between CM-and PBM-based propensity estimation methods based on historical user clicks.

Cite

CITATION STYLE

APA

Vardasbi, A., De Rijke, M., & Markov, I. (2020). Cascade Model-based Propensity Estimation for Counterfactual Learning to Rank. In SIGIR 2020 - Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 2089–2092). Association for Computing Machinery, Inc. https://doi.org/10.1145/3397271.3401299

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free