Self-healing network for scalable fault tolerant runtime environments

0Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Scalable and fault tolerant runtime environments are needed to support and adapt to the underlying libraries and hardware which require a high degree of scalability in dynamic large-scale environments. This paper presents a self-healing network (SHN) for supporting scalable and fault-tolerant runtime environments. The SHN is designed to support transmission of messages across multiple nodes while also protecting against recursive node and process failures. It will automatically recover itself after a failure occurs. SHN is implemented on top of a scalable fault-tolerant protocol (SFTP). The experimental results show that both the latest multicast and broadcast routing algorithms used in SHN are faster than the original SFTP routing algorithms.

Cite

CITATION STYLE

APA

Angskun, T., Fagg, G. E., Bosilca, G., Pješivac-Grbović, J., & Dongarra, J. J. (2007). Self-healing network for scalable fault tolerant runtime environments. In Distributed and Parallel Systems: From Cluster to Grid Computing (pp. 73–80). Springer US. https://doi.org/10.1007/978-0-387-69858-8_8

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free