In modern NUMA architectures, preserving data access locality is a key issue to guarantee performance. We define, for the OpenMP programming model, a type of architecture-agnostic programmer hint to describe the behaviour of parallel loops. These hints are only related to features of the program, in particular to the data accessed by each loop iteration. The runtime will then combine this information with architectural information gathered during its initialization, to guide task scheduling, in case of dynamic loop iteration scheduling. We prove the effectiveness of the proposed technique on the NAS parallel benchmark suite, achieving an average speedup of 1.21x. © 2011 Springer-Verlag.
CITATION STYLE
Di Biagio, A., Speziale, E., & Agosta, G. (2011). Exploiting thread-data affinity in OpenMP with data access patterns. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6852 LNCS, pp. 230–241). https://doi.org/10.1007/978-3-642-23400-2_22
Mendeley helps you to discover research relevant for your work.