Work stealing for multi-core HPC clusters

Kaushik Ravichandran; Sangho Lee; Santosh Pande

Conference ProceedingsOPEN ACCESS

Work stealing for multi-core HPC clusters

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6852 LNCS(PART 1) 205-217

DOI: 10.1007/978-3-642-23400-2_20

25Citations

12Readers

Abstract

Today a significant fraction of HPC clusters are built from multi-core machines connected via a high speed interconnect, hence, they have a mix of shared memory and distributed memory. Work stealing algorithms are currently designed for either a shared memory architecture or for a distributed memory architecture and are extended to work on these multi-core clusters by assuming a single underlying architecture. However, as the number of cores in each node increase, the differences between a shared memory architecture and a distributed memory architecture become more acute. Current work stealing approaches are not suitable for multi-core clusters due to the dichotomy of the underlying architecture. We combine the best aspects of both the current approaches in to a new algorithm. Our algorithm allows for more efficient execution of large-scale HPC applications, such as UTS, on clusters which have large multi-cores. As the number of cores per node increase, which is inevitable given today's processor trends, such an approach is crucial. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Ravichandran, K., Lee, S., & Pande, S. (2011). Work stealing for multi-core HPC clusters. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6852 LNCS, pp. 205–217). https://doi.org/10.1007/978-3-642-23400-2_20

Work stealing for multi-core HPC clusters

Abstract

Author supplied keywords

Cite

Register to see more suggestions