Distributed NVRAM cache - Optimization and evaluation with power of adjacency matrix

Artur Malinowski; Paweł Czarnul

Conference ProceedingsOPEN ACCESS

Distributed NVRAM cache - Optimization and evaluation with power of adjacency matrix

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10244 LNCS 15-26

DOI: 10.1007/978-3-319-59105-6_2

4Citations

2Readers

Abstract

In this paper we build on our previously proposed MPI I/O NVRAM distributed cache for high performance computing. In each cluster node it incorporates NVRAMs which are used as an intermediate cache layer between an application and a file for fast read/write operations supported through wrappers of MPI I/O functions. In this paper we propose optimizations of the solution including handling of write requests with a synchronous mode, additional modes preventing data preloading from a file and synchronization on file close if the solution is used as temporary cache only. Furthermore, we have evaluated the solution for a real application that computes powers of an adjacency matrix of a graph in parallel. We demonstrated superiority of our solution compared to a regular MPI I/O implementation for various powers and numbers of graph nodes. Finally, we presented good scalability of the solution for more than 600 processes running on a large HPC cluster.

Author supplied keywords

Cite

CITATION STYLE

APA

Malinowski, A., & Czarnul, P. (2017). Distributed NVRAM cache - Optimization and evaluation with power of adjacency matrix. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10244 LNCS, pp. 15–26). Springer Verlag. https://doi.org/10.1007/978-3-319-59105-6_2

Distributed NVRAM cache - Optimization and evaluation with power of adjacency matrix

Abstract

Author supplied keywords

Cite

Register to see more suggestions