Recently, there have been several experimental and theoretical results showing significant performance benefits of recursive algorithms on both multi-level memory hierarchies and on shared-memory systems. In particular, such algorithms have the data reuse characteristics of a blocked algorithm that is simultaneously blocked at many different levels. Most existing applications, however, are written using ordinary loops. We present a new compiler transformation that can be used to convert loop nests into recursive form automatically. We show that the algorithm is fast and effective, handling loop nests with arbitrary nesting and control flow. The transformation achieves substantial performance improvements for several linear algebra codes even on a current system with a two level cache hierarchy. As a side-effect of this work, we also develop an improved algorithm for transitive dependence analysis (a powerful technique used in the recursion transformation and other loop transformations) that is much faster than the best previously known algorithm in practice. Copyright 2000 ACM.
CITATION STYLE
Yi, Q., Adve, V., & Kennedy, K. (2000). Transforming loops to recursion for multi-level memory hierarchies. SIGPLAN Notices (ACM Special Interest Group on Programming Languages), 35(5), 169–181. https://doi.org/10.1145/358438.349323
Mendeley helps you to discover research relevant for your work.