Abstract
In this article, we present RoCC, a robust congestion control approach for datacenter networks based on RDMA. RoCC leverages switch queue size as an input to a PI controller, which computes the fair data rate of flows in the queue. The PI parameters are self-tuning to guarantee stability, rapid convergence, and fair and near-optimal throughput in a wide range of congestion scenarios. Our simulation and DPDK implementation results show that RoCC can achieve up to 7× reduction in PFC frames generated under high load levels, compared to DCQCN. At the same time, RoCC can achieve 1.7-4.5× and 1.4-3.9× lower tail latency for long flows and 2.1-7× and 3.5-8.2× lower tail latency for short flows, compared to DCQCN and HPCC, respectively. We also find that RoCC does not require PFC. The functional components of RoCC can be efficiently implemented in P4 and FPGA-based switch hardware.
Author supplied keywords
Cite
CITATION STYLE
Menikkumbura, D., Taheri, P., Vanini, E., Fahmy, S., Eugster, P., & Edsall, T. (2023). Congestion Control for Datacenter Networks: A Control-Theoretic Approach. IEEE Transactions on Parallel and Distributed Systems, 34(5), 1682–1696. https://doi.org/10.1109/TPDS.2023.3259799
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.