Abstract
Supercomputer performance is highly dependent on its interconnection subsystem design. In this paper we study how different architectural approaches for router design impact into system performance when running real parallel applications. A thorough methodology has been employed to quantify this impact. Architectural router decisions have been chosen taking into account the constraints of the underlying VLSI technology. After that, an exhaustive evaluation of the interconnection network under standard synthetic traffic has been carried out. Finally, an execution-driven simulation environment has been used to assess the consequences of several router designs on the performance of the entire machine. We will show that low-level decisions, as the adequate selection of router's arbiter, significantly reduce the execution time of parallel applications. To illustrate the effects of the router architecture on system performance two benchmarks were selected: Radix and MP3D.
Cite
CITATION STYLE
Puente, V., Gregorio, J. A., Izu, C., Beivide, R., & Vallejo, F. (1999). Low-level router design and its impact on supercomputer system performance. Proceedings of the International Conference on Supercomputing, 193–201. https://doi.org/10.1145/305138.305193
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.