Expressing DOACROSS loop dependences in OpenMP

Jun Shirako; Priya Unnikrishnan; Sanjay Chatterjee; Kelvin Li; Vivek Sarkar

Conference Proceedings

Expressing DOACROSS loop dependences in OpenMP

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 8122 LNCS 30-44

DOI: 10.1007/978-3-642-40698-0_3

6Citations

5Readers

Get full text

Abstract

OpenMP is a widely used programming standard for a broad range of parallel systems. In the OpenMP programming model, synchronization points are specified by implicit or explicit barrier operations within a parallel region. However, certain classes of computations, such as stencil algorithms, can be supported with better synchronization efficiency and data locality when using doacross parallelism with point-to-point synchronization than wavefront parallelism with barrier synchronization. In this paper, we propose new synchronization constructs to enable doacross parallelism in the context of the OpenMP programming model. Experimental results on a 32-core IBM Power7 system using four benchmark programs show performance improvements of the proposed doacross approach over OpenMP barriers by factors of 1.4× to 5.2× when using all 32 cores. © 2013 Springer-Verlag.

Cite

CITATION STYLE

APA

Shirako, J., Unnikrishnan, P., Chatterjee, S., Li, K., & Sarkar, V. (2013). Expressing DOACROSS loop dependences in OpenMP. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8122 LNCS, pp. 30–44). https://doi.org/10.1007/978-3-642-40698-0_3

Expressing DOACROSS loop dependences in OpenMP

Abstract

Cite

Register to see more suggestions