Scalability is promoted as a key quality feature of modern big data stream processing engines. However, even though research made huge efforts to provide precise definitions and corresponding metrics for the term scalability, experimental scalability evaluations or benchmarks of stream processing engines apply different and inconsistent metrics. With this paper, we aim to establish general metrics for scalability of stream processing engines. Derived from common definitions of scalability in cloud computing, we propose two metrics: a load capacity function and a resource demand function. Both metrics relate provisioned resources and load intensities, while requiring specific service level objectives to be fulfilled. We show how these metrics can be employed for scalability benchmarking and discuss their advantages in comparison to other metrics, used for stream processing engines and other software systems.
CITATION STYLE
Henning, S., & Hasselbring, W. (2021). How to measure scalability of distributed stream processing engines? In ICPE 2021 - Companion of the ACM/SPEC International Conference on Performance Engineering (pp. 85–88). Association for Computing Machinery, Inc. https://doi.org/10.1145/3447545.3451190
Mendeley helps you to discover research relevant for your work.