Efficient Solving of Scan Primitive on Multi-GPU Systems
| UDC.coleccion | Investigación | es_ES |
| UDC.conferenceTitle | IPDPS 2018 | es_ES |
| UDC.departamento | Enxeñaría de Computadores | es_ES |
| UDC.endPage | 803 | es_ES |
| UDC.grupoInv | Grupo de Arquitectura de Computadores (GAC) | es_ES |
| UDC.startPage | 794 | es_ES |
| dc.contributor.author | Pérez Diéguez, Adrián | |
| dc.contributor.author | Amor, Margarita | |
| dc.contributor.author | Doallo, Ramón | |
| dc.contributor.author | Nukada, Akira | |
| dc.contributor.author | Matsuoka, Satoshi | |
| dc.date.accessioned | 2025-01-22T12:17:34Z | |
| dc.date.available | 2025-01-22T12:17:34Z | |
| dc.date.issued | 2018 | |
| dc.description | This version of the article has been accepted for publication, after peer review. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The Version of Record is available online at: https://doi.org/10.1109/IPDPS.2018.00089 | es_ES |
| dc.description | Presented at: 32nd IEEE International Parallel and Distributed Processing Symposium, IPDPS 2018, Vancouver, 21-25 May 2018 | es_ES |
| dc.description.abstract | [Abstract]: GPUs fulfill high computation demands, but it is necessary to develop code carefully, selecting algorithms well suited to the GPU architecture and applying different optimizations. This article presents a GPU-suitable algorithm and a tuning strategy for performing the scan primitive over large problem sizes in CUDA. This tuning strategy defines different performance premises to find the GPU execution parameters that maximize performance. Taking these premises into consideration, we easily develop the kernels using CUDA skeletons to ensure efficiency and portability. Based on this, we describe an optimal proposal analyzed over different multiple GPU environments, the first multiple-GPU batch scan proposal to the best of our knowledge. The resulting implementations outperform other well-known libraries in most cases, such as CUDPP, ModernGPU, Thrust, CUB and LightScan. | es_ES |
| dc.description.sponsorship | This work was cofunded by the Government of Galicia and ERDF funds from the EU, under the Consolidation Programme of Competitive Reference Groups [ED431C 2017/04] and Competitive Research Units [R2014/049 and R2016/037]; by the Ministry of Economy and Competitiveness of Spain and ERDF funds [TIN2016-75845-P]; and by the Ministry of Education of Spain (FPU14/02801). | es_ES |
| dc.description.sponsorship | Xunta de Galicia; ED431C 2017/04 | es_ES |
| dc.description.sponsorship | Xunta de Galicia; R2014/049 | es_ES |
| dc.description.sponsorship | Xunta de Galicia; R2016/037 | es_ES |
| dc.identifier.citation | A. P. Diéguez, M. Amor, R. Doallo, A. Nukada and S. Matsuoka, "Efficient Solving of Scan Primitive on Multi-GPU Systems," 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS), Vancouver, BC, Canada, 2018, pp. 794-803, doi: 10.1109/IPDPS.2018.00089. | es_ES |
| dc.identifier.doi | 10.1109/IPDPS.2018.00089 | |
| dc.identifier.isbn | 978153864368-6 | |
| dc.identifier.issn | 1530-2075 | |
| dc.identifier.uri | http://hdl.handle.net/2183/40835 | |
| dc.language.iso | eng | es_ES |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/MINECO/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN2016-75845-P/ES/NUEVOS DESAFIOS EN COMPUTACION DE ALTAS PRESTACIONES: DESDE ARQUITECTURAS HASTA APLICACIONES (II) | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/MECD/Programa Estatal de Promoción del Talento y su Empleabilidad/FPU14%2F02801/ES/ | es_ES |
| dc.relation.uri | https://doi.org/10.1109/IPDPS.2018.00089 | es_ES |
| dc.rights | Copyright © 2018, IEEE | es_ES |
| dc.rights.accessRights | open access | es_ES |
| dc.subject | Graphics processing units | es_ES |
| dc.subject | Instruction sets | es_ES |
| dc.subject | Kernel | es_ES |
| dc.subject | Tuning | es_ES |
| dc.subject | Peer-to-peer computing | es_ES |
| dc.subject | Libraries | es_ES |
| dc.subject | Registers | es_ES |
| dc.subject | CUDA | es_ES |
| dc.subject | MultiGPU | es_ES |
| dc.subject | MPI | es_ES |
| dc.subject | Scan | es_ES |
| dc.subject | Tuning | es_ES |
| dc.title | Efficient Solving of Scan Primitive on Multi-GPU Systems | es_ES |
| dc.type | conference output | es_ES |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 31d7c9d0-70ef-44ef-af1d-e40f560c41bc | |
| relation.isAuthorOfPublication | c98c1fe1-2016-44c1-9225-43fe1c6b8088 | |
| relation.isAuthorOfPublication | b3302f65-05d3-4b2c-b8b3-8503e58bba5e | |
| relation.isAuthorOfPublication.latestForDiscovery | 31d7c9d0-70ef-44ef-af1d-e40f560c41bc |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Amor_Margarita_2018_Efficient_Solving_of_Scan_Primitive_on_Multi_GPU_Systems.pdf
- Size:
- 1.82 MB
- Format:
- Adobe Portable Document Format
- Description:
- Versión aceptada

