An automatic optimizer for heterogeneous devices
View/ Open
Use this link to cite
http://hdl.handle.net/2183/34411
Except where otherwise noted, this item's license is described as Atribución-NoComercial-SinDerivadas 4.0 Internacional (CC-BY-NC-ND 4.0)
Collections
- GI-GAC - Artigos [181]
Metadata
Show full item recordTitle
An automatic optimizer for heterogeneous devicesDate
2020-05Citation
Fernández-Fabeiro, J., Andrade, D., Fraguela, B. B., & Doallo, R. (2020a). An automatic optimizer for heterogeneous devices. Future Generation Computer Systems, 106, 572–584. https://doi.org/10.1016/j.future.2020.01.018
Is version of
https://doi.org/10.1016/j.future.2020.01.018
Abstract
[Abstract]: Codes written in a naive way seldom effectively exploit the computing resources, while writing optimized codes is usually a complex task that requires certain levels of expertise. This problem is further increased in the presence of heterogeneous devices, which present more tunable parameters than regular CPUs and high sensitivity to the optimization decisions taken. Furthermore, portability is an added concern given the wide variety of accelerators available. This paper tackles this problem adding an automatic optimizer to a library that already provides an easy and portable way to program heterogeneous devices, the Heterogeneous Programming Library (HPL). Our optimizer takes as input a simple version of a code and then tunes it for the device where it is going to be executed by performing the most usual set of optimizations applicable in heterogeneous devices. These optimizations are parametrized using a set of optimization parameters that need to be tuned for the device. The HPL library has also been equipped with an autotuner that can be used to this purpose. The effectiveness of the autotuner and the optimizer has been tested on several codes and devices. The results show that the combination of the autotuner and the optimizer make the tested codes 16 times faster on average than the original codes written by the programmer.
Keywords
Heterogeneous systems
Performance portability
Performance tuning
OpenCL
Performance portability
Performance tuning
OpenCL
Description
Versión final aceptada de: https://doi.org/10.1016/j.future.2020.01.018 This version of the article: Fernández-Fabeiro, J., Andrade, D., Fraguela, B. B., & Doallo, R. (2020). 'An automaticoptimizer for heterogeneous devices' has been accepted for publication in: Future Generation Computer Systems, 106, 572–584.
The Version of Record is available online at: https://doi.org/10.1016/j.future.2020.01.018 .
Editor version
Rights
Atribución-NoComercial-SinDerivadas 4.0 Internacional (CC-BY-NC-ND 4.0)
ISSN
0167-739X
1872-7115
1872-7115