Strider: A hybrid adaptive distributed RDF stream processing engine

Xiangnan Ren; Olivier Curé

Conference ProceedingsOPEN ACCESS

Strider: A hybrid adaptive distributed RDF stream processing engine

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10587 LNCS 559-576

DOI: 10.1007/978-3-319-68288-4_33

16Citations

32Readers

Abstract

Real-time processing of data streams emanating from sensors is becoming a common task in Internet of Things scenarios. The key implementation goal consists in efficiently handling massive incoming data streams and supporting advanced data analytics services like anomaly detection. In an on-going, industrial project, a 24, /, 7 available stream processing engine usually faces dynamically changing data and workload characteristics. These changes impact the engine’s performance and reliability. We propose Strider, a hybrid adaptive distributed RDF Stream Processing engine that optimizes logical query plan according to the state of data streams. Strider has been designed to guarantee important industrial properties such as scalability, high availability, fault tolerance, high throughput and acceptable latency. These guarantees are obtained by designing the engine’s architecture with state-of-the-art Apache components such as Spark and Kafka. We highlight the efficiency (e.g., on a single machine machine, up, to 60x gain on throughput compared to state-of-the-art systems, a throughput of 3.1 million triples/second on a 9 machines cluster, a major breakthrough in this system’s category) of Strider on real-world and synthetic data sets.

Author supplied keywords

Cite

CITATION STYLE

APA

Ren, X., & Curé, O. (2017). Strider: A hybrid adaptive distributed RDF stream processing engine. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10587 LNCS, pp. 559–576). Springer Verlag. https://doi.org/10.1007/978-3-319-68288-4_33

Strider: A hybrid adaptive distributed RDF stream processing engine

Abstract

Author supplied keywords

Cite

Register to see more suggestions