Evaluating the efficiency of openmp tasking for unbalanced computation on diverse cpu architectures

3Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In the decade since support for task parallelism was incorporated into OpenMP, its use has remained limited in part due to concerns about its performance and scalability. This paper revisits a study from the early days of OpenMP tasking that used the Unbalanced Tree Search (UTS) benchmark as a stress test to gauge implementation efficiency. The present UTS study includes both Clang/LLVM and vendor OpenMP implementations on four different architectures. We measure parallel efficiency to examine each implementation’s performance in response to varying task granularity. We find that most implementations achieve over 90% efficiency using all available cores for tasks of O(100k) instructions, and the best even manage tasks of O(10k) instructions well.

Cite

CITATION STYLE

APA

Olivier, S. L. (2020). Evaluating the efficiency of openmp tasking for unbalanced computation on diverse cpu architectures. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12295 LNCS, pp. 18–33). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-58144-2_2

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free