A Comparison of Top-k Threshold Estimation Techniques for Disjunctive Query Processing

Antonio Mallia; Michal Siedlaczek; Mengyang Sun; Torsten Suel

Conference ProceedingsOPEN ACCESS

A Comparison of Top-k Threshold Estimation Techniques for Disjunctive Query Processing

International Conference on Information and Knowledge Management, Proceedings (2020) 2141-2144

DOI: 10.1145/3340531.3412080

19Citations

9Readers

Get full text

Abstract

In the top-k threshold estimation problem, given a query q, the goal is to estimate the score of the result at rank k. A good estimate of this score can result in significant performance improvements for several query processing scenarios, including selective search, index tiering, and widely used disjunctive query processing algorithms such as MaxScore, WAND, and BMW. Several approaches have been proposed, including parametric approaches, methods using random sampling, and a recent approach based on machine learning. However, previous work fails to perform any experimental comparison between these approaches. In this paper, we address this issue by reimplementing four major approaches and comparing them in terms of estimation error, running time, likelihood of an overestimate, and end-to-end performance when applied to common classes of disjunctive top-k query processing algorithms.

Author supplied keywords

Cite

CITATION STYLE

APA

Mallia, A., Siedlaczek, M., Sun, M., & Suel, T. (2020). A Comparison of Top-k Threshold Estimation Techniques for Disjunctive Query Processing. In International Conference on Information and Knowledge Management, Proceedings (pp. 2141–2144). Association for Computing Machinery. https://doi.org/10.1145/3340531.3412080

A Comparison of Top-k Threshold Estimation Techniques for Disjunctive Query Processing

Abstract

Author supplied keywords

Cite

Register to see more suggestions