Efficient Scan Operator Methods on a GPU

UDC.coleccionInvestigaciónes_ES
UDC.conferenceTitleSBAC-PAD 2014es_ES
UDC.departamentoEnxeñaría de Computadoreses_ES
UDC.grupoInvGrupo de Arquitectura de Computadores (GAC)es_ES
dc.contributor.authorPérez Diéguez, Adrián
dc.contributor.authorAmor, Margarita
dc.contributor.authorDoallo, Ramón
dc.date.accessioned2025-01-20T12:53:21Z
dc.date.available2025-01-20T12:53:21Z
dc.date.issued2014
dc.descriptionPresented at: 2014 IEEE 26th International Symposium on Computer Architecture and High Performance Computing, Paris, France, 22-24 October 2014es_ES
dc.descriptionThis version of the article has been accepted for publication, after peer review. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The Version of Record is available online at: https://doi.org/10.1109/SBAC-PAD.2014.23es_ES
dc.description.abstract[Abstract]: Current GPUs (Graphics Processing Units) offer high computational power at relatively low cost, nonetheless, this enhanced performance often comes at the expenses of flexibility and code complexity. Efficient GPU programming requires detailed knowledge on certain hardware aspects. The scan operator is an important building block for a wide range of algorithms. In this paper, we present a number of parallel scan methods based on the traditional cyclic reduction tridiagonal solver and the Ladner-Fischer parallel prefix adder. Futhermore, we analyze a set of new features introduced in the Kepler Nvidia architecture such as read-only data cache and shuffle instructions. Our methods provide an excellent performance in many cases, up to 48% improvement over the CUDA Data Parallel Primitives (CUDPP) library.es_ES
dc.description.sponsorshipThis research has been supported by the Galician Government (Xunta deGalicia) under the Consolidation Program of Competitive Reference Groups, cofunded by FEDER funds of the EU (Ref. GRC2013/055); and by the Ministry of Economy and Competitiviness of Spain and FEDER funds of the EU (Project TIN2013–42148-P)es_ES
dc.description.sponsorshipXunta de Galicia; GRC2013/055es_ES
dc.identifier.citationA. P. Diéguez, M. Amor and R. Doallo, "Efficient Scan Operator Methods on a GPU," 2014 IEEE 26th International Symposium on Computer Architecture and High Performance Computing, Paris, France, 2014, pp. 190-197, doi: 10.1109/SBAC-PAD.2014.23.es_ES
dc.identifier.doi10.1109/SBAC-PAD.2014.23
dc.identifier.issn1550-6533
dc.identifier.urihttp://hdl.handle.net/2183/40781
dc.language.isoenges_ES
dc.publisherIEEE Computer Societyes_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/MINECO/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN2013-42148-P/ES/NUEVOS DESAFIOS EN COMPUTACION DE ALTAS PRESTACIONES: DESDE ARQUITECTURAS HASTA APLICACIONESes_ES
dc.relation.urihttps://doi.org/10.1109/SBAC-PAD.2014.23es_ES
dc.rights© 2014 IEEE.es_ES
dc.rights.accessRightsopen accesses_ES
dc.subjectInstruction setses_ES
dc.subjectProposalses_ES
dc.subjectGraphics processing unitses_ES
dc.subjectKerneles_ES
dc.subjectComplexity theoryes_ES
dc.subjectArrayses_ES
dc.subjectRegisterses_ES
dc.titleEfficient Scan Operator Methods on a GPUes_ES
dc.typeconference outputes_ES
dspace.entity.typePublication
relation.isAuthorOfPublication31d7c9d0-70ef-44ef-af1d-e40f560c41bc
relation.isAuthorOfPublicationc98c1fe1-2016-44c1-9225-43fe1c6b8088
relation.isAuthorOfPublicationb3302f65-05d3-4b2c-b8b3-8503e58bba5e
relation.isAuthorOfPublication.latestForDiscovery31d7c9d0-70ef-44ef-af1d-e40f560c41bc

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Amor_Margarita_2014_Efficient_Scan_Operator_Methods_on_a_GPU.pdf
Size:
500.86 KB
Format:
Adobe Portable Document Format
Description:
Versión aceptada