How to measure scalability of distributed stream processing engines?

15Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Scalability is promoted as a key quality feature of modern big data stream processing engines. However, even though research made huge efforts to provide precise definitions and corresponding metrics for the term scalability, experimental scalability evaluations or benchmarks of stream processing engines apply different and inconsistent metrics. With this paper, we aim to establish general metrics for scalability of stream processing engines. Derived from common definitions of scalability in cloud computing, we propose two metrics: a load capacity function and a resource demand function. Both metrics relate provisioned resources and load intensities, while requiring specific service level objectives to be fulfilled. We show how these metrics can be employed for scalability benchmarking and discuss their advantages in comparison to other metrics, used for stream processing engines and other software systems.

Cite

CITATION STYLE

APA

Henning, S., & Hasselbring, W. (2021). How to measure scalability of distributed stream processing engines? In ICPE 2021 - Companion of the ACM/SPEC International Conference on Performance Engineering (pp. 85–88). Association for Computing Machinery, Inc. https://doi.org/10.1145/3447545.3451190

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free