In this paper we describe the architecture, design, and performance of the new cluster switch fabric and adapter called HPS (High Performance Switch). HPS delivers very low latency and very high bandwidth. We demonstrate latency of less than 4.3us MPI library; 1.8GB/s of delivered unidirectional bandwidth and 2.9GB/s of bidirectional bandwidth between 2 MPI tasks running on 1.9GHz Power 4+ IH based nodes. HPS also supports RDMA (remote direct memory access capability). A unique capability of RDMA over HPS is that reliable RDMA is supported over an underlying unreliable transport (unlike Infiniband and other RDMA transport protocols which depend on the underlying transport being reliable). We profile the performance of RDMA and its impact on striping for systems in which multiple network adapters are available to tasks of parallel jobs. © Springer-Verlag 2004.
CITATION STYLE
Govindaraju, R. K., Hochschild, P., Grice, D., Gildea, K., Blackmore, R., Bender, C. A., … Houston, J. (2004). Architecture and early performance of the new IBM HPS fabric and adapter. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3296, 156–165. https://doi.org/10.1007/978-3-540-30474-6_21
Mendeley helps you to discover research relevant for your work.