SPMD OpenMP versus MPI on a IBM SMP for 3 kernels of the NAS benchmarks

Géraud Krawezik; Guillaume Alléon; Franck Cappello

Conference Proceedings

SPMD OpenMP versus MPI on a IBM SMP for 3 kernels of the NAS benchmarks

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2002) 2327 LNCS 425-436

DOI: 10.1007/3-540-47847-7_39

4Citations

4Readers

Get full text

Abstract

Shared Memory Multiprocessors are becoming more popular since they are used to deploy large parallel computers. The current trend is to enlarge the number of processors inside such multiprocessor nodes. However a lot of existing applications are using the message passing paradigm even when running on shared memory machines. This is due to three main factors: 1) the legacy of previous versions written for distributed memory computers, 2) the difficulty to obtain high performances with OpenMP when using loop level parallelization and 3) the complexity of writing multithreaded programs using a low level thread library. In this paper we demonstrate that OpenMP can provide better performance than MPI on SMP machines. We use a coarse grain parallelization approach, also known as the SPMD programming style with OpenMP. The performance evaluation considers the IBM SP3 NH2 and three kernels of the NAS benchmark: FT, CG and MG. We compare three implementations of them: the NAS 2.3 MPI, a fine grain (loop level) OpenMP version and our SPMD OpenMP version. A breakdown of the execution times provides an explanation of the performance results. © 2002 Springer Berlin Heidelberg.

Cite

CITATION STYLE

APA

Krawezik, G., Alléon, G., & Cappello, F. (2002). SPMD OpenMP versus MPI on a IBM SMP for 3 kernels of the NAS benchmarks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2327 LNCS, pp. 425–436). Springer Verlag. https://doi.org/10.1007/3-540-47847-7_39

SPMD OpenMP versus MPI on a IBM SMP for 3 kernels of the NAS benchmarks

Abstract

Cite

Register to see more suggestions