Sparsity-Specific Code Optimization using Expression Trees

6Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.

Abstract

We introduce a code generator that converts unoptimized C++ code operating on sparse data into vectorized and parallel CPU or GPU kernels. Our approach unrolls the computation into a massive expression graph, performs redundant expression elimination, grouping, and then generates an architecture-specific kernel to solve the same problem, assuming that the sparsity pattern is fixed, which is a common scenario in many applications in computer graphics and scientific computing. We show that our approach scales to large problems and can achieve speedups of two orders of magnitude on CPUs and three orders of magnitude on GPUs, compared to a set of manually optimized CPU baselines. To demonstrate the practical applicability of our approach, we employ it to optimize popular algorithms with applications to physical simulation and interactive mesh deformation.

Author supplied keywords

Cite

CITATION STYLE

APA

Herholz, P., Tang, X., Schneider, T., Kamil, S., Panozzo, D., & Sorkine-Hornung, O. (2022). Sparsity-Specific Code Optimization using Expression Trees. ACM Transactions on Graphics, 41(5). https://doi.org/10.1145/3520484

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free