Automatic code generation and tuning for stencil kernels on modern shared memory architectures

17Citations
Citations of this article
26Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper, we present PATUS, a code generation and auto-tuning framework for stencil computations targeted at multi- and manycore processors, such as multicore CPUs and graphics processing units. PATUS, which stands for "Parallel Autotuned Stencils," generates a compute kernel from a specification of the stencil operation and a strategy which describes the parallelization and optimization to be applied, and leverages the autotuning methodology to optimize strategy-specific parameters for the given hardware architecture. © Springer-Verlag 2011.

Cite

CITATION STYLE

APA

Christen, M., Schenk, O., & Burkhart, H. (2011). Automatic code generation and tuning for stencil kernels on modern shared memory architectures. Computer Science - Research and Development, 26(3–4), 205–210. https://doi.org/10.1007/s00450-011-0160-6

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free