Performance analysis of OpenMP scheduling type on embarrassingly parallel matrix multiplication algorithm

3Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The paper investigates the effect of different OpenMP schedule types towards matrix multiplication algorithm which has embarrassingly parallel loop. OpenMP schedule type and chunk sizes were meant for fine tuning the behavior of parallel loop iterations. However, the most suitable schedule type and chunk size which give optimum parallel performance in this particular kind of loop could only be identified via benchmark. By default, static schedule type would be best suited for embarrassingly parallel loop with equal workload per iteration as it divides the chunk of work equally among threads and hence better load balance and lower overhead. This paper shows static schedule type is not necessarily the best candidate. All the schedule types give well load balance. This implies that the compiler is capable of assigning relatively equal workload among threads, despite the explicitly defined schedule type. Benchmark allows one to make trade-offs in OpenMP directive selection.

Cite

CITATION STYLE

APA

Qun, N. H., Khalib, Z. I. A., & Raof, R. A. A. (2018). Performance analysis of OpenMP scheduling type on embarrassingly parallel matrix multiplication algorithm. In Lecture Notes on Data Engineering and Communications Technologies (Vol. 5, pp. 917–925). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-319-59427-9_94

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free