A simple and efficient parallel FFT algorithm using the BSP model

Publication date

2001

Authors

Alves de Inda, M.
Bisseling, Rob h.ISNI 0000000384208994

Editors

Advisors

Supervisors

Document Type

Article
Open Access logo

License

Abstract

We present a new parallel radix-4 FFT algorithm based on the BSP model. Our parallel algorithm uses the group-cyclic distribution family, which makes it simple to understand and easy to implement. We show how to reduce the communication cost of the algorithm by a factor of 3, in the case that the input/output vector is in the cyclic distribution. We also show how to reduce computation time on computers with a cache-based architecture. We present performance results on a Cray T3E with up to 64 processors, obtaining reasonable efficiency levels for local problem sizes as small as 256 and very good efficiency levels for local sizes larger than 2048.

Keywords

Wiskunde en Informatica (WIIN), Mathematics, Wiskunde en computerwetenschappen, Landbouwwetenschappen, Wiskunde: algemeen, Taverne

Citation

Alves de Inda, M & Bisseling, R H 2001, 'A simple and efficient parallel FFT algorithm using the BSP model', Parallel Computing, vol. 27, no. 14, pp. 1847-1878. https://doi.org/10.1016/S0167-8191(01)00118-1