In this paper we discuss the interaction of expression templates with OpenCL devices. We show how the expression tree of expression templates can be used to generate problem specific OpenCL kernels. In a second approach we use expression templates to optimize the data transfer between the host and the device which leads to a measurable performance increase in a domain specific language approach. We tested the functionality, correctness and performance for both implementations in a case study for vector and matrix operations. © 2012 Springer-Verlag.
CITATION STYLE
Bawidamann, U., & Nehmeier, M. (2012). Expression templates and OpenCL. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7204 LNCS, pp. 71–80). https://doi.org/10.1007/978-3-642-31500-8_8
Mendeley helps you to discover research relevant for your work.