Modern HEC systems, such as Blue Gene/P, rely on achieving high-performance by using the parallelism of a massive number of low-frequency/low-power processing cores. This means that the local pre- and post-communication processing required by the MPI stack might not be very fast, owing to the slow processing cores. Similarly, small amounts of serialization within the MPI stack that were acceptable on small/medium systems can be brutal on massively parallel systems. In this paper, we study different non-data-communication overheads within the MPI implementation on the IBM Blue Gene/P system. © 2008 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Balaji, P., Chan, A., Gropp, W., Thakur, R., & Lusk, E. (2008). Non-data-communication overheads in MPI: Analysis on blue Gene/P. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5205 LNCS, pp. 13–22). https://doi.org/10.1007/978-3-540-87475-1_9
Mendeley helps you to discover research relevant for your work.