Scalable NIC architecture to support offloading of large scale MPI barrier

0Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

MPI collective communication overhead dominates the communication cost for large scale parallel computers, scalability and operation latency for collective communication is critical for next generation computers. This paper proposes a fast and scalable barrier communication offload approach which supports millions of compute cores. Following our approach, the barrier operation sequence is packed by host MPI driver into the barrier "descriptor", which is pushed to the NIC (Network-Interfaces). The NIC can complete the barrier automatically following its algorithm descriptor. Our approach leverages an enhanced dissemination algorithm which is suitable for current large scale networks. We show that our approach achieves both barrier performance and scalability, especially for large scale computer system. This paper also proposes an extendable and easy-to-implement NIC architecture supporting barrier offload communication and also other communication pattern. © 2013 Springer-Verlag.

Cite

CITATION STYLE

APA

Wang, S., Xu, W., Wu, D., Pang, Z., & Lu, P. (2013). Scalable NIC architecture to support offloading of large scale MPI barrier. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8299 LNCS, pp. 214–226). https://doi.org/10.1007/978-3-642-45293-2_16

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free