In multi-Tenant data centers, each tenant desires reassuring predictability from the virtual network fabric-bandwidth guarantee, work conservation, and bounded tail latency. Achieving these goals simultaneously relies on rapid and precise traffic admission. However, the slow convergence (tens of milliseconds) of prior works can hardly satisfy the increasingly rigorous performance demand under dynamic traffic patterns. Further, state-of-The-Art load balance schemes are all guarantee-Agnostic and bring great risks on breaking bandwidth guarantee, which is overlooked in prior works. In this paper, we propose μFab, a predictable virtual fabric solution which can (1) explicitly select proper paths for all flows and (2) converge to ideal bandwidth allocation at sub-millisecond timescales. The core idea of μFab is to leverage the programmable data plane to build a fusion of an active edge (e.g., NIC) and an informative core (e.g., switch), where the core sends link status and tenant information to the edge via telemetry to help the latter make a timely and accurate decision on path selection and traffic admission. We fully implement μFab with commodity SmartNICs and programmable switches. Evaluations show that μFab can keep minimum bandwidth guarantee with high bandwidth utilization and near-optimal transmission latency in various network situations with limited probing bandwidth overhead. Application-level experiments, e.g., compute and storage scenarios, show that μFab can improve QPS by 2.5× and cut tail latency by more than 21× compared to the alternatives.
CITATION STYLE
Wang, S., Gao, K., Qian, K., Li, D., Miao, R., Li, B., … Zhang, M. (2022). Predictable vFabric on informative data plane. In SIGCOMM 2022 - Proceedings of the ACM SIGCOMM 2022 Conference (pp. 615–632). Association for Computing Machinery, Inc. https://doi.org/10.1145/3544216.3544241
Mendeley helps you to discover research relevant for your work.