Performance-based parallel loop self-scheduling on heterogeneous multicore PC clusters

Chao Tung Yang; Jen Hsiang Chang; Chao Chin Wu

Conference Proceedings

Performance-based parallel loop self-scheduling on heterogeneous multicore PC clusters

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 5938 LNCS 509-514

DOI: 10.1007/978-3-642-11842-5_71

0Citations

1Readers

Get full text

Abstract

In recent years, Multicore computers have been widely included in cluster systems. They adopt shared memory architectures. However, previous researches on parallel loop self-scheduling did not consider the feature of multicore computers. It is more suitable for shared-memory multiprocessors to adopt OpenMP for parallel programming. In this paper, we propose a performance-based approach that partitions loop iterations according to the performance weighting of cluster nodes. Because the iterations assigned to one MPI process will be processed in parallel by OpenMP threads running by the processor cores in the same computational node, the number of loop iterations to be allocated to one computational node at each scheduling step also depends on the number of processor cores in that node. Experimental results show that the proposed approach performs better than previous schemes. © 2010 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Yang, C. T., Chang, J. H., & Wu, C. C. (2010). Performance-based parallel loop self-scheduling on heterogeneous multicore PC clusters. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5938 LNCS, pp. 509–514). https://doi.org/10.1007/978-3-642-11842-5_71

Performance-based parallel loop self-scheduling on heterogeneous multicore PC clusters

Abstract

Author supplied keywords

Cite

Register to see more suggestions