Optimizing convolution operators is an important issue as they are used in numerous domains including electromagnetic computations, image processing and nanosimuations. In this paper we present our optimizations for 3D convolutions in the BigDFT nanosimulation software. We focus on processors with vector units and on GPU acceleration and experiment with several architectures. Exploiting the relation between algorithmic specifics and hardware architecture, we obtain performance gains of around x2 on CPU and up to x20 on GPU. © 2013 Springer-Verlag.
CITATION STYLE
Videau, B., Marangozova-Martin, V., Genovese, L., & Deutsch, T. (2013). Optimizing 3D convolutions for wavelet transforms on CPUs with SSE units and GPUs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8097 LNCS, pp. 826–837). https://doi.org/10.1007/978-3-642-40047-6_82
Mendeley helps you to discover research relevant for your work.