SSCJ: A semi-stream cache join using a front-stage cache module

8Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Semi-stream processing has become an emerging area of research in the field of data stream management. One common operation in semi-stream processing is joining a stream with disk-based master data using a join operator. This join operator typically works under limited main memory and this memory is generally not large enough to hold the whole disk-based master data. Recently, a number of semi-stream join algorithms have been proposed in the literature to achieve an optimal performance but still there is room to improve the performance. In this paper we propose a novel Semi-Stream Cache Join (SSCJ) using a front-stage cache module. The algorithm takes advantage of skewed distributions, and we present results for Zipfian distributions of the type that appear in many applications. We analyze the performance of SSCJ with a well known related join algorithm, HYBRIDJOIN (Hybrid Join). We also provide the cost model for our approach and validate it with experiments. © 2013 Springer-Verlag GmbH.

Cite

CITATION STYLE

APA

Naeem, M. A., Weber, G., Dobbie, G., & Lutteroth, C. (2013). SSCJ: A semi-stream cache join using a front-stage cache module. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8057 LNCS, pp. 236–247). https://doi.org/10.1007/978-3-642-40131-2_20

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free