An automatic optimizer for heterogeneous devices
Ver/ abrir
Use este enlace para citar
http://hdl.handle.net/2183/34411
A non ser que se indique outra cousa, a licenza do ítem descríbese como Atribución-NoComercial-SinDerivadas 4.0 Internacional (CC-BY-NC-ND 4.0)
Coleccións
- GI-GAC - Artigos [181]
Metadatos
Mostrar o rexistro completo do ítemTítulo
An automatic optimizer for heterogeneous devicesData
2020-05Cita bibliográfica
Fernández-Fabeiro, J., Andrade, D., Fraguela, B. B., & Doallo, R. (2020a). An automatic optimizer for heterogeneous devices. Future Generation Computer Systems, 106, 572–584. https://doi.org/10.1016/j.future.2020.01.018
É version de
https://doi.org/10.1016/j.future.2020.01.018
Resumo
[Abstract]: Codes written in a naive way seldom effectively exploit the computing resources, while writing optimized codes is usually a complex task that requires certain levels of expertise. This problem is further increased in the presence of heterogeneous devices, which present more tunable parameters than regular CPUs and high sensitivity to the optimization decisions taken. Furthermore, portability is an added concern given the wide variety of accelerators available. This paper tackles this problem adding an automatic optimizer to a library that already provides an easy and portable way to program heterogeneous devices, the Heterogeneous Programming Library (HPL). Our optimizer takes as input a simple version of a code and then tunes it for the device where it is going to be executed by performing the most usual set of optimizations applicable in heterogeneous devices. These optimizations are parametrized using a set of optimization parameters that need to be tuned for the device. The HPL library has also been equipped with an autotuner that can be used to this purpose. The effectiveness of the autotuner and the optimizer has been tested on several codes and devices. The results show that the combination of the autotuner and the optimizer make the tested codes 16 times faster on average than the original codes written by the programmer.
Palabras chave
Heterogeneous systems
Performance portability
Performance tuning
OpenCL
Performance portability
Performance tuning
OpenCL
Descrición
Versión final aceptada de: https://doi.org/10.1016/j.future.2020.01.018 This version of the article: Fernández-Fabeiro, J., Andrade, D., Fraguela, B. B., & Doallo, R. (2020). 'An automaticoptimizer for heterogeneous devices' has been accepted for publication in: Future Generation Computer Systems, 106, 572–584.
The Version of Record is available online at: https://doi.org/10.1016/j.future.2020.01.018 .
Versión do editor
Dereitos
Atribución-NoComercial-SinDerivadas 4.0 Internacional (CC-BY-NC-ND 4.0)
ISSN
0167-739X
1872-7115
1872-7115