With increasing number of processors available on nowadays high performance computing systems, the mean time between failure of these machines is decreasing. The ability of hardware and software components to handle process failures is therefore getting increasingly important. The objective of this paper is to present a fault tolerant approach for the implicit forward time integration of parabolic problems using explicit formulas. This technique allows the application to recover from process failures and to reconstruct the lost data of the failed process(es) avoiding the roll-back operation required in most checkpoint-restart schemes. The benchmark used to highlight the new algorithms is the two dimensional heat equation solved with a first order implicit Euler scheme. © Springer-Verlag Berlin Heidelberg 2006.
CITATION STYLE
Ltaief, H., Garbey, M., & Gabriel, E. (2006). Parallel fault tolerant algorithms for parabolic problems. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4128 LNCS, pp. 700–709). Springer Verlag. https://doi.org/10.1007/11823285_73
Mendeley helps you to discover research relevant for your work.