Effective barrier synchronization on Intel Xeon Phi coprocessor

Andrey Rodchenko; Andy Nisbet; Antoniu Pop; Mikel Luján

Conference ProceedingsOPEN ACCESS

Effective barrier synchronization on Intel Xeon Phi coprocessor

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9233 588-600

DOI: 10.1007/978-3-662-48096-0_45

19Citations

7Readers

Abstract

Barriers are a fundamental synchronization primitive, underpinning the parallel execution models of many modern shared-memory parallel programming languages such as OpenMP, OpenCL or Cilk, and are one of the main challenges to scaling. State-of-the-art barrier synchronization algorithms differ in tradeoffs between critical path length, communication traffic patterns and memory footprint. In this paper, we evaluate the efficiency of five such algorithms on the Intel Xeon Phi coprocessor. In addition, we present a novel hybrid barrier implementation that exploits the topology, the memory hierarchy and streaming stores of the Xeon Phi architecture to achieve a 3× lower overhead than the Intel OpenMP barrier implementation (ICC 14.0.0), thus outperforming, to the best of our knowledge, all other implementations, and which we evaluate on the CG and MG kernels from the NAS Parallel Benchmarks, the direct N-body simulation kernel and the EPCC barrier OpenMP microbenchmark. The optimized barriers presented in the paper are available at https://github.com/arodchen/cbarriers released as free software.

Author supplied keywords

Cite

CITATION STYLE

APA

Rodchenko, A., Nisbet, A., Pop, A., & Luján, M. (2015). Effective barrier synchronization on Intel Xeon Phi coprocessor. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9233, pp. 588–600). Springer Verlag. https://doi.org/10.1007/978-3-662-48096-0_45

Effective barrier synchronization on Intel Xeon Phi coprocessor

Abstract

Author supplied keywords

Cite

Register to see more suggestions