Block-recursive codes for dense numerical linear algebra computations appear to be well-suited for execution on machines with deep memory hierarchies because they are effectively blocked for all levels of the hierarchy. In this paper, we describe compiler technology to translate iterative versions of a number of numerical kernels into block-recursive form. We also study the cache behavior and performance of these compiler generated block-recursive codes.
CITATION STYLE
Ahmed, N., & Pingali, K. (2000). Automatic generation of block-recursive codes. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1900, pp. 368–378). Springer Verlag. https://doi.org/10.1007/3-540-44520-x_48
Mendeley helps you to discover research relevant for your work.