An overview is given of the lessons learned from the introduction of multi-threading using OpenMP in tmLQCD. In particular, programming style, performance measurements, cache misses, scaling, thread distribution for hybrid codes, race conditions, the overlapping of communication and computation and the measurement and reduction of certain overheads are discussed. Performance measurements and sampling profiles are given for different implementations of the hopping matrix computational kernel.
CITATION STYLE
Deuzeman, A., Jansen, K., Kostrzewa, B., & Urbach, C. (2013). Experiences with OpenMP in tmLQCD. In Proceedings of Science (Vol. 29-July-2013). Proceedings of Science (PoS). https://doi.org/10.22323/1.187.0416
Mendeley helps you to discover research relevant for your work.