Klotski: Efficient and Safe Network Migration of Large Production Datacenters

0Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper presents the design, implementation, evaluation, and deployment of Meta's production network migration system. We first introduce the network migration problem for large-scale production datacenter networks (DCNs). A network migration task at Meta touches as many as hundreds of switches and tens of thousands of circuits per datacenter (DC), and involves physical deployment work on site that can last months. We describe real-world migration challenges, covering complex and evolving DCN architectures and operational constraints. We mathematically formalize the problem of generating efficient and safe migration plans, and exploit the inherent symmetry and locality of DCN topologies to prune the search space. We design an ordering-agnostic compact topology representation to eliminate redundant satisfiability checking, and apply the A∗algorithm with a domain-specific priority function to find the optimal plan. Evaluation results on a range of production migration cases show that Klotski reduces the time to find optimal migration plans by up to 381× compared to prior solutions. We hope by introducing the problem and sharing our deployment experience, this work can provide a useful context for network migration in the real world and inspire future research.

Author supplied keywords

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Zhao, Y., Zhang, X., Zhu, H., Zhang, Y., Wang, Z., Tian, Y., … Jin, X. (2023). Klotski: Efficient and Safe Network Migration of Large Production Datacenters. In SIGCOMM 2023 - Proceedings of the ACM SIGCOMM 2023 Conference (pp. 783–797). Association for Computing Machinery, Inc. https://doi.org/10.1145/3603269.3604818

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 2

50%

Lecturer / Post doc 1

25%

Researcher 1

25%

Readers' Discipline

Tooltip

Computer Science 2

50%

Engineering 2

50%

Save time finding and organizing research with Mendeley

Sign up for free