To leverage high speed interconnects like InfiniBand it is important to minimize the communication overhead. The most interfering overhead is the registration of communication memory. In this paper, we present our analysis of the memory registration process inside the Mellanox InfiniBand driver and possible ways out of this bottleneck. We evaluate and characterize the most time consuming parts in the execution path of the memory registration function using the Read Time Stamp Counter (RDTSC) instruction. We present measurements on AMD Opteron and Intel Xeon systems with different types of Host Channel Adapters for PCI-X and PCI-Express. Finally, we conclude with first results using Linux hugepage support to shorten the time of registering a memory region. © Springer-Verlag Berlin Heidelberg 2006.
CITATION STYLE
Mietke, F., Rex, R., Baumgartl, R., Mehlan, T., Hoefler, T., & Rehm, W. (2006). Analysis of the memory registration process in the mellanox InfiniBand software stack. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4128 LNCS, pp. 124–133). Springer Verlag. https://doi.org/10.1007/11823285_13
Mendeley helps you to discover research relevant for your work.