Mostrar o rexistro simple do ítem
Portable and efficient FFT and DCT algorithms with the Heterogeneous Butterfly Processing Library
dc.contributor.author | Vázquez Pardo, Sergio | |
dc.contributor.author | Amor, Margarita | |
dc.contributor.author | Fraguela, Basilio B. | |
dc.date.accessioned | 2023-12-01T15:07:55Z | |
dc.date.available | 2023-12-01T15:07:55Z | |
dc.date.issued | 2019-03 | |
dc.identifier.citation | Vázquez, S., Amor, M., & Fraguela, B. B. (2019). Portable and efficient FFT and DCT algorithms with the heterogeneous butterfly processing library. Journal of Parallel and Distributed Computing, 125, 135–146. https://doi.org/10.1016/j.jpdc.2018.11.011 | es_ES |
dc.identifier.issn | 0743-7315 | |
dc.identifier.uri | http://hdl.handle.net/2183/34407 | |
dc.description | Versión final aceptada de: https://doi.org/10.1016/j.jpdc.2018.11.011 | es_ES |
dc.description | This version of the article: Vázquez, S., Amor, M., Fraguela, B. B. (2019). 'Portable and efficient FFT and DCT algorithms with the heterogeneous butterfly processing library', has been accepted for publication in Journal of Parallel and Distributed Computing, 125, 135–146. The Version of Record is available online at https://doi.org/10.1016/j.jpdc.2018.11.011. | es_ES |
dc.description.abstract | [Abstract]: The existence of a wide variety of computing devices with very different properties makes essential the development of software that is not only portable among them, but which also adapts to the properties of each platform. In this paper, we present the Heterogeneous Butterfly Processing Library (HBPL), which provides optimized portable kernels for problems of small sizes that allow using orthogonal transform algorithms such as the FFT and DCT on different accelerators and regular CPUs. Our library is implemented on the OpenCL standard, which provides portability on a large number of platforms. Furthermore, high performance is achieved on a wide range of devices by exploiting run-time code generation and metaprogramming guided by a parametrization strategy. An exhaustive evaluation on different platforms shows that our proposal obtains competitive or better performance than related libraries. | es_ES |
dc.description.sponsorship | This research has received financial support from the Ministerio de Economía y Competitividad of Spain and European Regional Development Fund (ERDF) funds (80%) of the EU (TIN2016-75845-P), by the Consellería de Cultura, Educación e Ordenación Universitaria, Xunta de Galicia co-founded by European Regional Development Fund (ERDF) funds under the Consolidation Programme of Competitive Reference Groups (Ref. ED431C 2017/04) and the Consolidation Programme of Competitive Research Units (Ref. R2014/049 and Ref. R2016/037) as well as by the Consellería de Cultura, Educación e Ordenación Universitaria, Xunta de Galicia (Centro Singular de Investigación de Galicia accreditation 2016–2019) and the European Union (European Regional Development Fund, ERDF) under Grant Ref. ED431G/01. | es_ES |
dc.description.sponsorship | Xunta de Galicia; ED431C 2017/04 | es_ES |
dc.description.sponsorship | Xunta de Galicia; ED431G/01 | es_ES |
dc.description.sponsorship | Xunta de Galicia; R2014/049 | es_ES |
dc.description.sponsorship | Xunta de Galicia; R2016/037 | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | Elsevier | es_ES |
dc.relation | info:eu-repo/grantAgreement/MINECO/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN2016-75845-P/ES/NUEVOS DESAFIOS EN COMPUTACION DE ALTAS PRESTACIONES: DESDE ARQUITECTURAS HASTA APLICACIONES (II)/ | es_ES |
dc.relation.isversionof | https://doi.org/10.1016/j.jpdc.2018.11.011 | |
dc.relation.uri | https://doi.org/10.1016/j.jpdc.2018.11.011 | es_ES |
dc.rights | Atribución-NoComercial-SinDerivadas 4.0 Internacional (CC-BY-NC-ND 4.0) | es_ES |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ | * |
dc.subject | Signal processing | es_ES |
dc.subject | Tuned library | es_ES |
dc.subject | Open computing language (OpenCL) | es_ES |
dc.subject | Heterogeneous platform | es_ES |
dc.subject | GPUs | es_ES |
dc.title | Portable and efficient FFT and DCT algorithms with the Heterogeneous Butterfly Processing Library | es_ES |
dc.type | info:eu-repo/semantics/article | es_ES |
dc.rights.access | info:eu-repo/semantics/openAccess | es_ES |
UDC.journalTitle | Journal of Parallel and Distributed Computing | es_ES |
UDC.volume | 125 | es_ES |
UDC.startPage | 135 | es_ES |
UDC.endPage | 146 | es_ES |
dc.identifier.doi | 10.1016/j.jpdc.2018.11.011 |
Ficheiros no ítem
Este ítem aparece na(s) seguinte(s) colección(s)
-
GI-GAC - Artigos [192]