Protocol-aware recovery for consensus-based distributed storage

2Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

Abstract

We introduce protocol-aware recovery (Par), a new approach that exploits protocol-specific knowledge to correctly recover from storage faults in distributed systems. We demonstrate the eficacy of Par through the design and implementation of corruption-tolerant replication (Ctrl), a Par mechanism specific to replicated state machine (RSM) systems. We experimentally show that the Ctrl versions of two systems, LogCabin and ZooKeeper, safely recover from storage faults and provide high availability, while the unmodified versions can lose data or become unavailable. We also show that the Ctrl versions achieve this reliability with little performance overheads.

Cite

CITATION STYLE

APA

Alagappan, R., Ganesan, A., Lee, E., Albarghouthi, A., Chidambaram, V., Arpaci-Dusseau, A. C., & Arpaci-Dusseau, R. H. (2018). Protocol-aware recovery for consensus-based distributed storage. ACM Transactions on Storage, 14(3). https://doi.org/10.1145/3241062

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free