There has been a significant research in collective communication operations, in particular in MPI broadcast, on distributed memory platforms. Most of the research works are done to optimize the collective operations for particular architectures by taking into account either their topology or platform parameters. In this work we propose a very simple and at the same time general approach to optimize legacy MPI broadcast algorithms, which are widely used in MPICH and OpenMPI. Theoretical analysis and experimental results on IBM BlueGene/P and a cluster of Grid’5000 platform are presented.
CITATION STYLE
Hasanov, K., Lastovetsky, A., & Quintin, J. N. (2014). High-level topology-oblivious optimization of mpi broadcast algorithms on extreme-scale platforms. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8806, pp. 412–424). Springer Verlag. https://doi.org/10.1007/978-3-319-14313-2_35
Mendeley helps you to discover research relevant for your work.