In this paper, we propose a new method for efficient processing of a top-k join query by its translation into a sequence of range queries, which are generated by performing iterative domain refinement of attributes included in the scoring function. In this process, we exploit the statistics for data distributions of the individual attributes, which in the form of histograms are available to an RDBMS. To improve the performance of our method, we use heuristic techniques to minimize the execution cost of range queries and the number of iterations. We use the PostgreSQL query engine optimizer to prove our theoretical results. We have done exhaustive set of experiments by exploiting different input parameters and by using cross checks to prove the results. We have applied our experiments to the TPC-H benchmark data sets, and the results we obtained confirm the efficiency of our approach. © 2012 Springer-Verlag.
CITATION STYLE
Sahpaski, D., Dimovski, A. S., Velinov, G., & Kon-Popovska, M. (2012). Efficient processing of top-k join queries by attribute domain refinement. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7503 LNCS, pp. 318–331). https://doi.org/10.1007/978-3-642-33074-2_24
Mendeley helps you to discover research relevant for your work.