A Comparison of Top-k Threshold Estimation Techniques for Disjunctive Query Processing

12Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In the top-k threshold estimation problem, given a query q, the goal is to estimate the score of the result at rank k. A good estimate of this score can result in significant performance improvements for several query processing scenarios, including selective search, index tiering, and widely used disjunctive query processing algorithms such as MaxScore, WAND, and BMW. Several approaches have been proposed, including parametric approaches, methods using random sampling, and a recent approach based on machine learning. However, previous work fails to perform any experimental comparison between these approaches. In this paper, we address this issue by reimplementing four major approaches and comparing them in terms of estimation error, running time, likelihood of an overestimate, and end-to-end performance when applied to common classes of disjunctive top-k query processing algorithms.

Cite

CITATION STYLE

APA

Mallia, A., Siedlaczek, M., Sun, M., & Suel, T. (2020). A Comparison of Top-k Threshold Estimation Techniques for Disjunctive Query Processing. In International Conference on Information and Knowledge Management, Proceedings (pp. 2141–2144). Association for Computing Machinery. https://doi.org/10.1145/3340531.3412080

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free