In the top-k threshold estimation problem, given a query q, the goal is to estimate the score of the result at rank k. A good estimate of this score can result in significant performance improvements for several query processing scenarios, including selective search, index tiering, and widely used disjunctive query processing algorithms such as MaxScore, WAND, and BMW. Several approaches have been proposed, including parametric approaches, methods using random sampling, and a recent approach based on machine learning. However, previous work fails to perform any experimental comparison between these approaches. In this paper, we address this issue by reimplementing four major approaches and comparing them in terms of estimation error, running time, likelihood of an overestimate, and end-to-end performance when applied to common classes of disjunctive top-k query processing algorithms.
CITATION STYLE
Mallia, A., Siedlaczek, M., Sun, M., & Suel, T. (2020). A Comparison of Top-k Threshold Estimation Techniques for Disjunctive Query Processing. In International Conference on Information and Knowledge Management, Proceedings (pp. 2141–2144). Association for Computing Machinery. https://doi.org/10.1145/3340531.3412080
Mendeley helps you to discover research relevant for your work.