Key points are not available for this paper at this time.
FFTW is an implementation of the discrete Fourier transform (DFT) that adapts to the hardware in order to maximize performance. This paper shows that such an approach can yield an implementation that is competitive with handoptimized libraries, and describes the software structure that makes our current FFTW3 version flexible and adaptive. We further discuss a new algorithm for real-data DFTs of prime size, a new way of implementing DFTs by means of machine-specific “SIMD” instructions, and how a special-purpose compiler can derive optimized implementations of the discrete cosine and sine transforms automatically from a DFT algorithm.
Building similarity graph...
Analyzing shared references across papers
Loading...
Matteo Frigo
Steven G. Johnson
Proceedings of the IEEE
Massachusetts Institute of Technology
IBM (United States)
IBM Research - Austin
Building similarity graph...
Analyzing shared references across papers
Loading...
Frigo et al. (Mon,) studied this question.
www.synapsesocial.com/papers/696f159250a360e9ca1198f5 — DOI: https://doi.org/10.1109/jproc.2004.840301