When running parallel programs on clusters of individual computers or workstations, network communication is often the performance bottleneck. Since the round-trip time for a network packet is orders of magnitude larger than the amount of time it takes for an equivalent amount of data to be transferred from memory, methods which reduce networkusage can result in significant performance improvements for parallel programs. This workdemonstrates that broadcast performance can be improved by a significant factor using a portable reliable multicasting protocol compared to unicasting, which is typically used. Our end-product is an MPICH patch that does not require kernel modification. It is therefore portable to any UNIX-based system. MPICH is a popular, portable MPI implementation provided by Argonne National Laboratories (ANL). Since absolute reliability is critical for data integrity when broadcasting messages on clusters, our multicasting protocol also addresses reliability issues.
CITATION STYLE
Sack, P., & Elster, A. C. (2002). Fast MPI broadcasts through reliable multicasting. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2367, pp. 445–453). Springer Verlag. https://doi.org/10.1007/3-540-48051-x_44
Mendeley helps you to discover research relevant for your work.