The OpenMP memory model allows for a temporary view of shared memory that only needs to be made consistent when barrier or flush directives, including those that are implicit, are encountered. While this relaxed memory consistency model is key to developing cluster OpenMP implementations, it means that the memory performance of any given implementation is greatly affected by which memory is used, when it is used, and by which threads. In this work we propose a micro-benchmark that can be used to measure memory consistency costs and present results for its application to two contrasting cluster OpenMP implementations, as well as comparing these results with data obtained from a hardware supported OpenMP environment. © 2008 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Wong, H. J., Cai, J., Rendell, A. P., & Strazdins, P. (2008). Micro-benchmarks for cluster openmp implementations: Memory consistency costs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5004 LNCS, pp. 60–70). Springer Verlag. https://doi.org/10.1007/978-3-540-79561-2_6
Mendeley helps you to discover research relevant for your work.