We present a study of implementations of DGEMM using both the cache-oblivious and cache-conscious programming styles. The cacheoblivious programs use recursion and automatically block DGEMM operands A, B, C for the memory hierarchy. The cache-conscious programs use iteration and explicitly block A, B, C for register files, all caches and memory. Our study shows that the cache-oblivious programs achieve substantially less performance than the cache-conscious programs. We discuss why this is so and suggest approaches for improving the performance of cache-oblivious programs. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Gunnels, J. A., Gustavson, F. G., Pingali, K., & Yotov, K. (2007). Is cache-oblivious DGEMM viable? In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4699 LNCS, pp. 919–928). Springer Verlag. https://doi.org/10.1007/978-3-540-75755-9_109
Mendeley helps you to discover research relevant for your work.