In this paper, we present a method for overlapping communications on parallel computers for pipelined algorithms. We first introduce a general theoretical model which leads to a generic computation scheme for the optimal packet size. Then, we use the OPIUM 3 library, which provides an easy-to-use and efficient way to compute, in the general case, this optimal packet size, on the column LU factorization; the implementation and performance measures are made on an Intel Paragon.
CITATION STYLE
Desprez, F., Ramet, P., & Roman, J. (1996). Optimal grain size computation for pipelined algorithms. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1123, pp. 165–172). Springer Verlag. https://doi.org/10.1007/3-540-61626-8_21
Mendeley helps you to discover research relevant for your work.