With data explosion in recent years, timely and cost-effective analytics over large scale data has been a hotspot of data management research. Join is an important operation in database query. However, data skew happens naturally in many applications, which will severely degrade the performance of most join algorithms. To address this problem, this paper introduces an Adaptive Skew Insensitive(ASI) join algorithm to handle with serious data skew. Based on our cost analysis, ASI join algorithm can adaptively choose the best join algorithm for different inputs. Compared with several state-of-the-art join methods through adequate experiments, our method achieves significant improvement of join efficiency dealing with data skew. © 2014 Springer International Publishing Switzerland.
CITATION STYLE
Liao, W., Wang, T., Li, H., Yang, D., Qiu, Z., & Lei, K. (2014). An adaptive skew insensitive join algorithm for large scale data analytics. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8709 LNCS, pp. 494–502). Springer Verlag. https://doi.org/10.1007/978-3-319-11116-2_44
Mendeley helps you to discover research relevant for your work.