Distributed joins have gained importance in the past decade, mainly due to the increased number of available data sources on the Internet. In this work we extend Bloomjoin, the state of the art algorithm for distributed joins, so that it minimizes the network usage for the query execution based on database statistics. We present 4 extensions of the algorithm, and construct a query optimizer for selecting the best extension for each query. Our theoretical analysis and experimental evaluation shows significant network cost savings compared to the original Bloomjoin algorithm. © 2009 Springer Berlin Heidelberg.
CITATION STYLE
Ramesh, S., Papapetrou, O., & Siberski, W. (2008). Optimizing distributed joins with bloom filters. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5375 LNCS, pp. 145–156). https://doi.org/10.1007/978-3-540-89737-8_15
Mendeley helps you to discover research relevant for your work.