Abstract
Communication overhead has been traditionally the primary metric for evaluating rollback-recovery protocols. This paper reexamines the prominence of this metric in light of the recent increases in processor and network speeds. We introduce a new recovery algorithm for a family of rollback-recovery protocols based on logging. The new algorithm incurs a higher communication overhead during recovery than previous algorithms, but it requires less access to stable storage and imposes no restrictions on the execution of live processes. Experimental results show that the new algorithm performs better than one that is optimized for low communication overhead. These results suggest that in modern environments, latency in accessing stable storage and intrusion of a particular algorithm on the execution of live processes are more important than the number of messages exchanged during recovery.
Cite
CITATION STYLE
Elnozahy, E. N. (1995). On the relevance of communication costs of rollback-recovery protocols. In Proceedings of the Annual ACM Symposium on Principles of Distributed Computing (pp. 74–79). ACM. https://doi.org/10.1145/224964.224973
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.