In this paper we study a fault tolerant model for Grid environments based on the task replication concept. The basic idea is to produce and submit to the Grid multiple replicas of a given task, given the fact that the failure probability for each one of them is known a priori. We introduce a scheme for the calculation of the number of replicas for the case of having diverse failure probabilities of each task replica and propose an efficient resource management scheme, based on fair share technique, which handles the task replicas so as to maintain in a fair way the fault tolerance in the Grid. Our study concludes with the presentation of the simulation results which validate the proposed scheme. © Springer-Verlag Berlin Heidelberg 2005.
CITATION STYLE
Litke, A., Tserpes, K., Dolkas, K., & Varvarigou, T. (2005). A task replication and fair resource management scheme for fault tolerant grids. In Lecture Notes in Computer Science (Vol. 3470, pp. 1022–1031). Springer Verlag. https://doi.org/10.1007/11508380_104
Mendeley helps you to discover research relevant for your work.