Locality-Aware Automatic Parallelization for GPGPU with OpenHMPP Directives

Andión, José M.; Arenaz Silva, Manuel; Bodin, François; Rodríguez, Gabriel; Touriño, Juan

dc.contributor.author	Andión, José M.
dc.contributor.author	Arenaz Silva, Manuel
dc.contributor.author	Bodin, François
dc.contributor.author	Rodríguez, Gabriel
dc.contributor.author	Touriño, Juan
dc.date.accessioned	2018-07-11T17:10:02Z
dc.date.available	2018-07-11T17:10:02Z
dc.date.issued	2016-06
dc.identifier.citation	Andión, J.M., Arenaz, M., Bodin, F. et al. Int J Parallel Prog (2016) 44: 620. https://doi.org/10.1007/s10766-015-0362-9	es_ES
dc.identifier.issn	0885-7458
dc.identifier.issn	1573-7640
dc.identifier.uri	http://hdl.handle.net/2183/20902
dc.description	This is a post-peer-review, pre-copyedit version of an article published in International Journal of Parallel Programming. The final authenticated version is available online at: https://doi.org/10.1007/s10766-015-0362-9	es_ES
dc.description.abstract	[Abstract] The use of GPUs for general purpose computation has increased dramatically in the past years due to the rising demands of computing power and their tremendous computing capacity at low cost. Hence, new programming models have been developed to integrate these accelerators with high-level programming languages, giving place to heterogeneous computing systems. Unfortunately, this heterogeneity is also exposed to the programmer complicating its exploitation. This paper presents a new technique to automatically rewrite sequential programs into a parallel counterpart targeting GPU-based heterogeneous systems. The original source code is analyzed through domain-independent computational kernels, which hide the complexity of the implementation details by presenting a non-statement-based, high-level, hierarchical representation of the application. Next, a locality-aware technique based on standard compiler transformations is applied to the original code through OpenHMPP directives. Two representative case studies from scientific applications have been selected: the three-dimensional discrete convolution and the simple-precision general matrix multiplication. The effectiveness of our technique is corroborated by a performance evaluation on NVIDIA GPUs.	es_ES
dc.description.sponsorship	Ministerio de Economía y Competitividad; TIN2010-16735	es_ES
dc.description.sponsorship	Ministerio de Economía y Competitividad; TIN2013-42148-P	es_ES
dc.description.sponsorship	Galicia, Consellería de Cultura, Educación e Ordenación Universitaria; GRC2013-055	es_ES
dc.description.sponsorship	Ministerio de Educación; AP2008-01012	es_ES
dc.language.iso	eng	es_ES
dc.publisher	Springer New York LLC	es_ES
dc.relation.uri	https://doi.org/10.1007/s10766-015-0362-9	es_ES
dc.subject	Heterogeneous systems	es_ES
dc.subject	GPGPU	es_ES
dc.subject	Locality	es_ES
dc.subject	Automatic parallelization	es_ES
dc.subject	OpenHMPP	es_ES
dc.subject	Domain-independent kernel	es_ES
dc.title	Locality-Aware Automatic Parallelization for GPGPU with OpenHMPP Directives	es_ES
dc.type	info:eu-repo/semantics/article	es_ES
dc.rights.access	info:eu-repo/semantics/openAccess	es_ES
UDC.journalTitle	International Journal of Parallel Programming	es_ES
UDC.volume	44	es_ES
UDC.issue	3	es_ES
UDC.startPage	620	es_ES
UDC.endPage	643	es_ES
dc.identifier.doi	10.1007/s10766-015-0362-9

Ficheiros no ítem

Nome:: J.M.Andión_Locality-Aware_Auto ...
Tamaño:: 535.7Kb
Formato:: PDF

Ver/abrir

Este ítem aparece na(s) seguinte(s) colección(s)

GI-GAC - Artigos [192]

Mostrar o rexistro simple do ítem