Efficient processing of top-k join queries by attribute domain refinement

Dragan Sahpaski; Aleksandar S. Dimovski; Goran Velinov; Margita Kon-Popovska

Conference Proceedings

Efficient processing of top-k join queries by attribute domain refinement

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7503 LNCS 318-331

DOI: 10.1007/978-3-642-33074-2_24

1Citations

5Readers

Get full text

Abstract

In this paper, we propose a new method for efficient processing of a top-k join query by its translation into a sequence of range queries, which are generated by performing iterative domain refinement of attributes included in the scoring function. In this process, we exploit the statistics for data distributions of the individual attributes, which in the form of histograms are available to an RDBMS. To improve the performance of our method, we use heuristic techniques to minimize the execution cost of range queries and the number of iterations. We use the PostgreSQL query engine optimizer to prove our theoretical results. We have done exhaustive set of experiments by exploiting different input parameters and by using cross checks to prove the results. We have applied our experiments to the TPC-H benchmark data sets, and the results we obtained confirm the efficiency of our approach. © 2012 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Sahpaski, D., Dimovski, A. S., Velinov, G., & Kon-Popovska, M. (2012). Efficient processing of top-k join queries by attribute domain refinement. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7503 LNCS, pp. 318–331). https://doi.org/10.1007/978-3-642-33074-2_24

Efficient processing of top-k join queries by attribute domain refinement

Abstract

Author supplied keywords

Cite

Register to see more suggestions