Iterative collective loop fusion

T. J. Ashby; M. F.P. O'Boyle

Conference ProceedingsOPEN ACCESS

Iterative collective loop fusion

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 3923 LNCS 202-216

DOI: 10.1007/11688839_17

1Citations

3Readers

Abstract

Naive code generation from high-level languages that encourage modularity can give rise to large numbers of simple loops for array-based programs. Collective loop fusion and array contraction can be used on such codes to improve temporal locality and performance. The problem is typically formalised using a loop dependence graph (LDG), with solutions denoted by fusion partitions. Much previous work has concentrated on approaches to the abstract formulation. We present our technique called iterative collective loop fusion based on empirically evaluating different transformations, and show how it can provide speedups over existing approaches of up to 1.38. We also give results showing that applying such techniques to high-level languages can provide speedups of up to 2.45 over the original code, and outperforms an equivalent code in Fortran. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Ashby, T. J., & O’Boyle, M. F. P. (2006). Iterative collective loop fusion. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3923 LNCS, pp. 202–216). https://doi.org/10.1007/11688839_17

Iterative collective loop fusion

Abstract

Cite

Register to see more suggestions