OpenMP is a widely used programming standard for a broad range of parallel systems. In the OpenMP programming model, synchronization points are specified by implicit or explicit barrier operations within a parallel region. However, certain classes of computations, such as stencil algorithms, can be supported with better synchronization efficiency and data locality when using doacross parallelism with point-to-point synchronization than wavefront parallelism with barrier synchronization. In this paper, we propose new synchronization constructs to enable doacross parallelism in the context of the OpenMP programming model. Experimental results on a 32-core IBM Power7 system using four benchmark programs show performance improvements of the proposed doacross approach over OpenMP barriers by factors of 1.4× to 5.2× when using all 32 cores. © 2013 Springer-Verlag.
CITATION STYLE
Shirako, J., Unnikrishnan, P., Chatterjee, S., Li, K., & Sarkar, V. (2013). Expressing DOACROSS loop dependences in OpenMP. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8122 LNCS, pp. 30–44). https://doi.org/10.1007/978-3-642-40698-0_3
Mendeley helps you to discover research relevant for your work.