In this paper, methods for efficient utilization of modern accelerator-based hardware for performing high-order finite-element computations are studied.We have implemented several versions of a matrixfree finite-element stiffness operator targeting graphics processors. Two different techniques for handling the issue of conflicting updates are investigated; one approach based on CUDA atomics, and a more advanced approach using mesh coloring. These are contrasted to a number of matrix-free CPU-based implementations. A comparison to standard matrix-based implementations for CPU and GPU is also made. The performance of the different approaches are evaluated through a series of benchmarks corresponding to a Poisson model problem. Depending on dimensionality and polynomial order, the best GPU-based implementations performed between four and ten times faster than the fastest CPU-based implementation.
CITATION STYLE
Ljungkvist, K. (2014). Matrix-free finite-element operator application on graphics processing units. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8806, pp. 450–461). Springer Verlag. https://doi.org/10.1007/978-3-319-14313-2_38
Mendeley helps you to discover research relevant for your work.