De-Biased Modeling of Search Click Behavior with Reinforcement Learning

Jianghong Zhou; Sayyed M. Zahiri; Simon Hughes; Khalifeh Al Jadda; Surya Kallumadi; Eugene Agichtein

Conference ProceedingsOPEN ACCESS

De-Biased Modeling of Search Click Behavior with Reinforcement Learning

SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (2021) 1637-1641

DOI: 10.1145/3404835.3463228

5Citations

13Readers

Get full text

Abstract

Users' clicks on Web search results are one of the key signals for evaluating and improving web search quality and have been widely used as part of current state-of-the-art Learning-To-Rank(LTR) models. With a large volume of search logs available for major search engines, effective models of searcher click behavior have emerged to evaluate and train LTR models. However, when modeling the users' click behavior, considering the bias of the behavior is imperative. In particular, when a search result is not clicked, it is not necessarily chosen as not relevant by the user, but instead could have been simply missed, especially for lower-ranked results. These kinds of biases in the click log data can be incorporated into the click models, propagating the errors to the resulting LTR ranking models or evaluation metrics. In this paper, we propose the De-biased Reinforcement Learning Click model (DRLC). The DRLC model relaxes previously made assumptions about the users' examination behavior and resulting latent states. To implement the DRLC model, convolutional neural networks are used as the value networks for reinforcement learning, trained to learn a policy to reduce bias in the click logs. To demonstrate the effectiveness of the DRLC model, we first compare performance with the previous state-of-art approaches using established click prediction metrics, including log-likelihood and perplexity. We further show that DRLC also leads to improvements in ranking performance. Our experiments demonstrate the effectiveness of the DRLC model in learning to reduce bias in click logs, leading to improved modeling performance and showing the potential for using DRLC for improving Web search quality.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhou, J., Zahiri, S. M., Hughes, S., Al Jadda, K., Kallumadi, S., & Agichtein, E. (2021). De-Biased Modeling of Search Click Behavior with Reinforcement Learning. In SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 1637–1641). Association for Computing Machinery, Inc. https://doi.org/10.1145/3404835.3463228

De-Biased Modeling of Search Click Behavior with Reinforcement Learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions