Bayesian Browsing Model

Chao Liu; Fan Guo; Christos Faloutsos

Journal Article

Bayesian Browsing Model

Liu C
Guo F
Faloutsos C

ACM Transactions on Knowledge Discovery from Data (2010) 4(4) 1-26

DOI: 10.1145/1857947.1857951

N/ACitations

12Readers

Get full text

Abstract

A fundamental challenge in utilizing Web search click data is to infer user-perceived relevance from the search log. Not only is the inference a difficult problem involving statistical reasonings but the bulky size, together with the ever-increasing nature, of the log data imposes extra requirements on scalability. In this paper, we propose the Bayesian Browsing Model (BBM), which performs exact inference of the document relevance, only requires a single pass of the data (i.e., the optimal scalability), and is shown effective. We present two sets of experiments to evaluate the model effectiveness and scalability. On the first set of over 50 million search instances of 1.1 million distinct queries, BBM outperforms the state-of-the-art competitor by 29.2% in log-likelihood while being 57 times faster. On the second click log set, spanning a quarter of petabyte, we showcase the scalability of BBM: we implemented it on a commercial MapReduce cluster, and it took only 3 hours to compute the relevance for 1.15 billion distinct query-URL pairs.

Cite

CITATION STYLE

APA

Liu, C., Guo, F., & Faloutsos, C. (2010). Bayesian Browsing Model. ACM Transactions on Knowledge Discovery from Data, 4(4), 1–26. https://doi.org/10.1145/1857947.1857951

Bayesian Browsing Model

Abstract

Cite

Register to see more suggestions