Cost of fault-tolerance on data stream processing

Valerio Vianello; Marta Patiño-Martínez; Ainhoa Azqueta-Alzúaz; Ricardo Jimenez-Péris

Conference ProceedingsOPEN ACCESS

Cost of fault-tolerance on data stream processing

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11339 LNCS 17-27

DOI: 10.1007/978-3-030-10549-5_2

0Citations

9Readers

Abstract

Data streaming engines process data on the fly in contrast to databases that first, store the data and then, they process it. In order to process the increasing amount of data produced every day, data streaming engines run on top of a distributed system. In this setting failures will likely happen. Current distributed data streaming engines like Apache Flink provide fault tolerance. In this paper we evaluate the impact on performance of fault tolerance mechanisms of Flink during regular operation (when there are no failures) on a distributed system and the impact on performance when there are failures. We use the Intel HiBench for conducting the evaluation.

Author supplied keywords

Cite

CITATION STYLE

APA

Vianello, V., Patiño-Martínez, M., Azqueta-Alzúaz, A., & Jimenez-Péris, R. (2019). Cost of fault-tolerance on data stream processing. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11339 LNCS, pp. 17–27). Springer Verlag. https://doi.org/10.1007/978-3-030-10549-5_2

Cost of fault-tolerance on data stream processing

Abstract

Author supplied keywords

Cite

Register to see more suggestions