Fault tolerant file models for MPI-IO parallel file systems

1Citations
Citations of this article
21Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Parallelism in file systems is obtained by using several independent server nodes supporting one or more secondary storage devices. This approach increases the performance and scalability of the system, but a fault in one single node can make the whole system fail. In order to avoid this problem, data must be stored using some kind of redundant technique, so that it can be recovered in case of failure. Fault tolerance can be provided in I/O systems by using replication or RAID based schemes. However, most of the current systems apply the same technique of fault tolerant at disk or file system level. This paper1 describes how fault tolerance support can be used by MPI applications based on PVFS version 2 [1], a well-know parallel file system for clusters. This support can be applied to other parallel file systems with many benefits: fault tolerance at file level, flexible definition of new fault tolerance scheme, and dynamic reconfiguration of the fault tolerance policy. © Springer-Verlag Berlin Heidelberg 2007.

Cite

CITATION STYLE

APA

Calderón, A., García-Carballeira, F., Isailǎ, F., Keller, R., & Schulz, A. (2007). Fault tolerant file models for MPI-IO parallel file systems. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4757 LNCS, pp. 153–160). Springer Verlag. https://doi.org/10.1007/978-3-540-75416-9_25

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free