Efficient algorithms for crash recovery in distributed systems

18Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We consider the problem of recovering from processor failures efficiently in distributed systems. Each message received is logged in volatile storage when it is processed. At irregular intervals, each processor independently saves the contents of its volatile storage in stable storage. By appending only O(1) extra information to each message, we show that for recovery in general networks O(n2) messages are sufficient and in ring networks ©(n) messages are necessary and sufficient when an arbitrary number of processors fail. By appending O(n) extra information to each message that is sent, we show that O(kn) messages are sufficient for rollingback all of the processors to the maximum consistent states when there are k failures.

Cite

CITATION STYLE

APA

Juang, T. T. Y., & Venkatesan, S. (1990). Efficient algorithms for crash recovery in distributed systems. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 472 LNCS, pp. 349–361). Springer Verlag. https://doi.org/10.1007/3-540-53487-3_56

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free