The design and implementation of FFTW3

3.9kCitations
Citations of this article
1.1kReaders
Mendeley users who have this article in their library.
Get full text

Abstract

FFTW is an implementation of the discrete Fourier transform (DFT) that adapts to the hardware in order to maximize performance. This paper shows that such an approach can yield an implementation that is competitive with hand-optimized libraries, and describes the software structure that makes our current FFTW3 version flexible and adaptive. We further discuss a new algorithm for real-data DFTs of prime size, a new way of implementing DFTs by means of machine-specific single-instruction, multiple-data (SIMD) instructions, and how a special-purpose compiler can derive optimized implementations of the discrete cosine and sine transforms automatically from a DFT algorithm. © 2005 IEEE.

Cite

CITATION STYLE

APA

Frigo, M., & Johnson, S. G. (2005). The design and implementation of FFTW3. In Proceedings of the IEEE (Vol. 93, pp. 216–231). https://doi.org/10.1109/JPROC.2004.840301

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free