In this paper we put forward an annotation system for specifying a sequence of data layout transformations for loop vectorization. We propose four basic primitives for data layout transformations that programmers can compose to achieve complex data layout transformations. Our system automatically modifies all loops and other code operating on the transformed arrays. In addition, we propose data layout aware loop transformations to reduce the overhead of address computation and help vectorization. Taking the Scalar Penta-diagonal (SP) solver, from the NAS Parallel Benchmarks as a case study, we show that the programmer can achieve significant speedups using our annotations. © 2014 IFIP International Federation for Information Processing.
CITATION STYLE
Xu, S., & Gregg, D. (2014). Semi-automatic composition of data layout transformations for loop vectorization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8707 LNCS, pp. 485–496). Springer Verlag. https://doi.org/10.1007/978-3-662-44917-2_40
Mendeley helps you to discover research relevant for your work.