CORBA based runtime support for load distribution and fault tolerance

0Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Parallel scientific computing in a distributed computing environment based on CORBA requires additional services not (yet) included in the CORBA specification: load distribution and fault tolerance. Both of them are essential for long running applications with high computational demands as in the case of computational engineering applications. The proposed approach for providing these services is based on integrating load distribution into the CORBA naming service which in turn relies on information provided by the underlying Winner resource management system developed for typical networked Unix workstation environments. The support of fault tolerance is based on error detection and backward recovery by introducing proxy objects which manage checkpointing and restart of services in case of failures. A prototypical implementation of the complete system is presented, and performance results obtained for the parallel optimization of a mathematical benchmark function are discussed. © 2000 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Barth, T., Flender, G., Freisleben, B., Grauer, M., & Thilo, F. (2000). CORBA based runtime support for load distribution and fault tolerance. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1800 LNCS, pp. 1144–1151). Springer Verlag. https://doi.org/10.1007/3-540-45591-4_158

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free