Abstract
Many recent efforts have demonstrated the performance benefits of running datacenter functions (\emph{e.g.,} NATs, load balancers, monitoring) on programmable switches. However, a key missing piece remains: fault tolerance. This is especially critical as the network is no longer stateless and pure endpoint recovery does not suffice. In this paper, we design and implement RedPlane, a fault-tolerant state store for stateful in-switch applications. This provides in-switch applications consistent access to their state, even if the switch they run on fails or traffic is rerouted to an alternative switch. We address key challenges in devising a practical, provably correct replication protocol and implementing it in the switch data plane. Our evaluations show that RedPlane incurs negligible overhead and enables end-to-end applications to rapidly recover from switch failures.
Author supplied keywords
Cite
CITATION STYLE
Kim, D., Nelson, J., Ports, D. R. K., Sekar, V., & Seshan, S. (2021). RedPlane: Enabling fault-tolerant stateful in-switch applications. In SIGCOMM 2021 - Proceedings of the ACM SIGCOMM 2021 Conference (pp. 223–244). Association for Computing Machinery, Inc. https://doi.org/10.1145/3452296.3472905
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.