Abstract
In this paper, we propose a new approach for computing multidimensional DFT's that reduces interprocessor communications and is therefore suitable for efficient implementation on a variety of multiprocessor platforms including MIMD supercomputers and clusters of workstations. Group theoretic concepts are used to formulate a flexible computational strategy that hybrids the reduced transform algorithm (RTA) with the Good-Thomas factorization and can deal efficiently with non-power-of-two sizes without resorting to zero-padding. The RTA algorithm is employed not as a data processing but rather as a bookkeeping tool in order to decompose the problem into many smaller size subproblems (lines) that can be solved independently by the processors. Implementation issues on an Intel iPSC/i860 hypercube are discussed and timing results for large 2-D and 3-D DFTs with index sets in Z/MP x Z/KP and Z/N x Z/MP x Z/KP respectively are provided, where N. M, K are powers-of-two and P is a small prime number such as 3, 5, or 7. The nonoptimized realizations of the new hybrid RTA approach are shown to outperform by as much as 70% the optimized assembly coded realizations of the traditional row-column method on the iPSC/860. © 1995 IEEE
Cite
CITATION STYLE
Kechriotis, G. I., Bletsas, M., & Manolakos, E. S. (1995). A New Approach for Computing Multidimensional DFT’s on Parallel Machines and its Implementation on the iPSC/860 Hypercube. IEEE Transactions on Signal Processing, 43(1), 272–285. https://doi.org/10.1109/78.365307
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.