This paper gives a technical discussion of the Intel Pentium® Pro processor and optimization strategies used to achieve high performance on scientific applications. We demonstrate these optimizations by characterizing matrix multiplication (DGEMM). We give insight and a model into our efforts on obtaining the world's first TeraFLOP MP LINPACK run (on the Intel ASCI Option Red Supercomputer), based on Pentium Pro processor technology. The importance of this paper is carried by the increasing trend of commodity parts in the supercomputing arena. © 1997 ACM.
CITATION STYLE
Greer, B., & Henry, G. (1997). High performance software on intel pentium pro processors or micro-ops to TeraFLOPS. In Proceedings of the International Conference on Supercomputing. Association for Computing Machinery. https://doi.org/10.1145/509593.509639
Mendeley helps you to discover research relevant for your work.