Skadi: Building a Distributed Runtime for Data Systems in Disaggregated Data Centers

1Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.

Abstract

Data-intensive systems are the backbone of today's computing and are responsible for shaping data centers. Over the years, cloud providers have relied on three principles to maintain cost-effective data systems: use disaggregation to decouple scaling, use domain-specific computing to battle waning laws, and use serverless to lower costs. Although they work well individually, they fail to work in harmony: an issue amplified by emerging data system workloads.In this paper, we envision a distributed runtime to mitigate current shortcomings. The distributed runtime has a tiered access layer exposing declarative APIs, underpinned by a stateful serverless runtime with a distributed task execution model. It will be the narrow waist between data systems and hardware. Users are oblivious to data location, concurrency, disaggregation style, or even the hardware to do the computing. The underlying stateful serverless runtime transparently evolves with novel data-center architectures, such as disaggregation and tightly-coupled clusters. We prototype Skadi to showcase that the distributed runtime is practical.

Cite

CITATION STYLE

APA

Hu, C., Wang, C., Wang, S., Sun, N., Bao, Y., Zhao, J., … Shan, Y. (2023). Skadi: Building a Distributed Runtime for Data Systems in Disaggregated Data Centers. In HotOS 2023 - Proceedings of the 19th Workshop on Hot Topics in Operating Systems (pp. 94–102). Association for Computing Machinery, Inc. https://doi.org/10.1145/3593856.3595897

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free