This paper addresses the problem of building a failure detection service for large scale distributed systems, as well as multi-agent systems. It describes the failure detector mechanism and defines the roles it plays in the system. Afterwards, the key construction problems that are fundamental in the context of building the failure detection service are presented. Finally, a sketch of general framework for implementing such a service is described. The proposed failure detection service can be used by mobile agents as a crucial component for building fault-tolerant multi-agent systems. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Kobusiński, J. (2007). Failure detection service for large scale systems. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4496 LNAI, pp. 675–684). Springer Verlag. https://doi.org/10.1007/978-3-540-72830-6_70
Mendeley helps you to discover research relevant for your work.