Skadi: Building a Distributed Runtime for Data Systems in Disaggregated Data Centers

Cunchen Hu; Chenxi Wang; Sa Wang; Ninghui Sun; Yungang Bao; Jieru Zhao; Sanidhya Kashyap; Pengfei Zuo; Xusheng Chen; Liangliang Xu; Qin Zhang; Hao Feng; Yizhou Shan

Conference ProceedingsOPEN ACCESS

Skadi: Building a Distributed Runtime for Data Systems in Disaggregated Data Centers

HotOS 2023 - Proceedings of the 19th Workshop on Hot Topics in Operating Systems (2023) 94-102

DOI: 10.1145/3593856.3595897

1Citations

19Readers

Abstract

Data-intensive systems are the backbone of today's computing and are responsible for shaping data centers. Over the years, cloud providers have relied on three principles to maintain cost-effective data systems: use disaggregation to decouple scaling, use domain-specific computing to battle waning laws, and use serverless to lower costs. Although they work well individually, they fail to work in harmony: an issue amplified by emerging data system workloads.In this paper, we envision a distributed runtime to mitigate current shortcomings. The distributed runtime has a tiered access layer exposing declarative APIs, underpinned by a stateful serverless runtime with a distributed task execution model. It will be the narrow waist between data systems and hardware. Users are oblivious to data location, concurrency, disaggregation style, or even the hardware to do the computing. The underlying stateful serverless runtime transparently evolves with novel data-center architectures, such as disaggregation and tightly-coupled clusters. We prototype Skadi to showcase that the distributed runtime is practical.

Cite

CITATION STYLE

APA

Hu, C., Wang, C., Wang, S., Sun, N., Bao, Y., Zhao, J., … Shan, Y. (2023). Skadi: Building a Distributed Runtime for Data Systems in Disaggregated Data Centers. In HotOS 2023 - Proceedings of the 19th Workshop on Hot Topics in Operating Systems (pp. 94–102). Association for Computing Machinery, Inc. https://doi.org/10.1145/3593856.3595897

Skadi: Building a Distributed Runtime for Data Systems in Disaggregated Data Centers

Abstract

Cite

Register to see more suggestions