Abstract
Due to the long-run and unpredictable nature of stream processing, any statically configuredexecution of stream jobs fails to process data in a timely and efficient manner. To achieve performance requirements, stream jobs need to be reconfigured dynamically. In this paper, we present Trisk, a control plane that support versatile reconfigurations while keeping high efficiency with easy-to-use programming APIs. Trisk enables versatile reconfigurations with usability based on a task-centric abstraction, and encapsulates primitive operations such that reconfigurations can be described by compositing the primitive operations on the abstraction. Trisk adopts a partial pause-and-resume design for efficiency, through which synchronization mechanisms in the native stream systems can further be leveraged. We implement Trisk on Apache Flink and demonstrate its usage and performance under realistic application scenarios. We show that Trisk executes reconfigurations with shorter completion time and comparable latency compared to a state-of-the-art fluid mechanism for state management.
Cite
CITATION STYLE
Mao, Y., Huang, Y., Tian, R., Wang, X., & Ma, R. T. B. (2021). Trisk: Task-centric data stream reconfiguration. In SoCC 2021 - Proceedings of the 2021 ACM Symposium on Cloud Computing (pp. 214–228). Association for Computing Machinery, Inc. https://doi.org/10.1145/3472883.3487010
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.