Towards Local-Failure Local-Recovery in PDE Frameworks: The Case of Linear Solvers

Mirco Altenbernd; Nils Arne Dreier; Christian Engwer; Dominik Göddeke

Conference Proceedings

Towards Local-Failure Local-Recovery in PDE Frameworks: The Case of Linear Solvers

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2021) 12456 LNCS 17-38

DOI: 10.1007/978-3-030-67077-1_2

1Citations

1Readers

Get full text

Abstract

It is expected that with the appearance of exascale supercomputers the mean time between failure in supercomputers will decrease. Classical checkpoint-restart approaches are too expensive at that scale. Local-failure local-recovery (LFLR) strategies are an option that promises to leverage the costs, but actually implementing it into any sufficiently large simulation environment is a challenging task. In this paper we discuss how LFLR methods can be incorporated in a PDE framework, focussing at the linear solvers as the innermost component. We discuss how Krylov solvers can be modified to support LFLR, and present numerical tests. We exemplify our approach by reporting on the implementation of these features in the Dune framework, present C++ software abstractions, which simplify the incorporation of LFLR techniques and show how we use these in our solver library. To reduce the memory costs of full remote backups, we further investigate the benefits of lossy compression and in-memory checkpointing.

Author supplied keywords

Cite

CITATION STYLE

APA

Altenbernd, M., Dreier, N. A., Engwer, C., & Göddeke, D. (2021). Towards Local-Failure Local-Recovery in PDE Frameworks: The Case of Linear Solvers. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12456 LNCS, pp. 17–38). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-67077-1_2

Towards Local-Failure Local-Recovery in PDE Frameworks: The Case of Linear Solvers

Abstract

Author supplied keywords

Cite

Register to see more suggestions