Distributed in-memory analytics for big temporal data

Bin Yao; Wei Zhang; Zhi Jie Wang; Zhongpu Chen; Shuo Shang; Kai Zheng; Minyi Guo

Conference Proceedings

Distributed in-memory analytics for big temporal data

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 10827 LNCS 549-565

DOI: 10.1007/978-3-319-91452-7_36

3Citations

5Readers

Get full text

Abstract

The temporal data is ubiquitous, and massive amount of temporal data is generated nowadays. Management of big temporal data is important yet challenging. Processing big temporal data using a distributed system is a desired choice. However, existing distributed systems/methods either cannot support native queries, or are disk-based solutions, which could not well satisfy the requirements of high throughput and low latency. To alleviate this issue, this paper proposes an In-memory based Two-level Index Solution in Spark (ITISS) for processing big temporal data. The framework of our system is easy to understand and implement, but without loss of efficiency. We conduct extensive experiments to verify the performance of our solution. Experimental results based on both real and synthetic datasets consistently demonstrate that our solution is efficient and competitive.

Author supplied keywords

Cite

CITATION STYLE

APA

Yao, B., Zhang, W., Wang, Z. J., Chen, Z., Shang, S., Zheng, K., & Guo, M. (2018). Distributed in-memory analytics for big temporal data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10827 LNCS, pp. 549–565). Springer Verlag. https://doi.org/10.1007/978-3-319-91452-7_36

Distributed in-memory analytics for big temporal data

Abstract

Author supplied keywords

Cite

Register to see more suggestions