Evaluating model serving strategies over streaming data

Sonia Horchidan; Emmanouil Kritharakis; Vasiliki Kalavri; Paris Carbone

Conference ProceedingsOPEN ACCESS

Evaluating model serving strategies over streaming data

Proceedings of the 6th Workshop on Data Management for End-To-End Machine Learning, DEEM 2022 - In conjunction with the 2022 ACM SIGMOD/PODS Conference (2022)

DOI: 10.1145/3533028.3533308

6Citations

7Readers

Abstract

We present the first performance evaluation study of model serving integration tools in stream processing frameworks. Using Apache Flink as a representative stream processing system, we evaluate alternative Deep Learning serving pipelines for image classification. Our performance evaluation considers both the case of embedded use of Machine Learning libraries within stream tasks and that of external serving via Remote Procedure Calls. The results indicate superior throughput and scalability for pipelines that make use of embedded libraries to serve pre-trained models. Whereas, latency can vary across strategies, with external serving even achieving lower latency when network conditions are optimal due to better specialized use of underlying hardware. We discuss our findings and provide further motivating arguments towards research in the area of ML-native data streaming engines in the future.

Author supplied keywords

Cite

CITATION STYLE

APA

Horchidan, S., Kritharakis, E., Kalavri, V., & Carbone, P. (2022). Evaluating model serving strategies over streaming data. In Proceedings of the 6th Workshop on Data Management for End-To-End Machine Learning, DEEM 2022 - In conjunction with the 2022 ACM SIGMOD/PODS Conference. Association for Computing Machinery, Inc. https://doi.org/10.1145/3533028.3533308

Evaluating model serving strategies over streaming data

Abstract

Author supplied keywords

Cite

Register to see more suggestions