Portable and efficient FFT and DCT algorithms with the Heterogeneous Butterfly Processing Library
![Thumbnail](/dspace/bitstream/handle/2183/34407/Vazquez_Pardo_Sergio_2019_Portable_and_efficient_FFT_and_DCT_algorithms_with_the_Heterogeneous_Butterfly_Processing_Library.pdf.jpg?sequence=5&isAllowed=y)
Use this link to cite
http://hdl.handle.net/2183/34407
Except where otherwise noted, this item's license is described as Atribución-NoComercial-SinDerivadas 4.0 Internacional (CC-BY-NC-ND 4.0)
Collections
- GI-GAC - Artigos [187]
Metadata
Show full item recordTitle
Portable and efficient FFT and DCT algorithms with the Heterogeneous Butterfly Processing LibraryDate
2019-03Citation
Vázquez, S., Amor, M., & Fraguela, B. B. (2019). Portable and efficient FFT and DCT algorithms with the heterogeneous butterfly processing library. Journal of Parallel and Distributed Computing, 125, 135–146. https://doi.org/10.1016/j.jpdc.2018.11.011
Is version of
https://doi.org/10.1016/j.jpdc.2018.11.011
Abstract
[Abstract]: The existence of a wide variety of computing devices with very different properties makes essential the development of software that is not only portable among them, but which also adapts to the properties of each platform. In this paper, we present the Heterogeneous Butterfly Processing Library (HBPL), which provides optimized portable kernels for problems of small sizes that allow using orthogonal transform algorithms such as the FFT and DCT on different accelerators and regular CPUs. Our library is implemented on the OpenCL standard, which provides portability on a large number of platforms. Furthermore, high performance is achieved on a wide range of devices by exploiting run-time code generation and metaprogramming guided by a parametrization strategy. An exhaustive evaluation on different platforms shows that our proposal obtains competitive or better performance than related libraries.
Keywords
Signal processing
Tuned library
Open computing language (OpenCL)
Heterogeneous platform
GPUs
Tuned library
Open computing language (OpenCL)
Heterogeneous platform
GPUs
Description
Versión final aceptada de: https://doi.org/10.1016/j.jpdc.2018.11.011 This version of the article: Vázquez, S., Amor, M., Fraguela, B. B. (2019). 'Portable and efficient FFT and DCT algorithms with
the heterogeneous butterfly processing library', has been accepted for publication in Journal of Parallel and Distributed
Computing, 125, 135–146. The Version of Record is available online at https://doi.org/10.1016/j.jpdc.2018.11.011.
Editor version
Rights
Atribución-NoComercial-SinDerivadas 4.0 Internacional (CC-BY-NC-ND 4.0)
ISSN
0743-7315