Solving Large Problem Sizes of Index-Digit Algorithms on GPU: FFT and Tridiagonal System Solvers

UDC.coleccionInvestigaciónes_ES
UDC.departamentoEnxeñaría de Computadoreses_ES
UDC.endPage101es_ES
UDC.grupoInvGrupo de Arquitectura de Computadores (GAC)es_ES
UDC.issue1es_ES
UDC.journalTitleIEEE Transactions on Computerses_ES
UDC.startPage86es_ES
UDC.volume67es_ES
dc.contributor.authorPérez Diéguez, Adrián
dc.contributor.authorAmor, Margarita
dc.contributor.authorLobeiras Blanco, Jacobo
dc.contributor.authorDoallo, Ramón
dc.date.accessioned2018-08-13T10:28:15Z
dc.date.embargoEndDate2020-02-01es_ES
dc.date.embargoLift2020-02-01
dc.date.issued2018
dc.description.abstract[Abstract] Current Graphics Processing Units (GPUs) are capable of obtaining high computational performance in scientific applications. Nevertheless, programmers have to use suitable parallel algorithms for these architectures and usually have to consider optimization techniques in the implementation in order to achieve said performance. There are many efficient proposals for limited-size problems which fit directly in the shared memory of CUDA GPUs, however, there are few GPU proposals that tackle the design of efficient algorithms for large problem sizes that exceed shared memory storage capacity. In this work, we present a tuning strategy that addresses this problem for some parallel prefix algorithms that can be represented according to a set of common permutations of the digits of each of its element indices [1], denoted as Index-Digit (ID) algorithms. Specifically, our strategy has been applied to develop flexible Multi-Stage (MS) algorithms for the Fast Fourier Transform (FFT) algorithm (MS-ID-FFT) and a tridiagonal system solver (MS-ID-TS) on the GPU. The resulting implementation is compact and outperforms other well-known and commonly used state-of-the-art libraries, with an improvement of up to 1.47x with respect to NVIDIA's complex CUFFT, and up to 33.2x in comparison with NVIDIA's CUSPARSE for real data tridiagonal systems.es_ES
dc.identifier.citationA. P. Diéguez, M. Amor, J. Lobeiras and R. Doallo, "Solving Large Problem Sizes of Index-Digit Algorithms on GPU: FFT and Tridiagonal System Solvers," in IEEE Transactions on Computers, vol. 67, no. 1, pp. 86-101, 1 Jan. 2018. doi: 10.1109/TC.2017.2723879es_ES
dc.identifier.doi10.1109/TC.2017.2723879
dc.identifier.issn0018-9340
dc.identifier.issn1557-9956
dc.identifier.urihttp://hdl.handle.net/2183/20961
dc.language.isoenges_ES
dc.publisherInstitute of Electrical and Electronics Engineerses_ES
dc.relation.urihttp://dx.doi.org/10.1109/TC.2017.2723879es_ES
dc.rights.accessRightsopen accesses_ES
dc.subjectGraphics processing unitses_ES
dc.subjectInstruction setses_ES
dc.subjectKerneles_ES
dc.subjectSignal processing algorithmses_ES
dc.subjectComputer architecturees_ES
dc.subjectSynchronizationes_ES
dc.subjectProposalses_ES
dc.titleSolving Large Problem Sizes of Index-Digit Algorithms on GPU: FFT and Tridiagonal System Solverses_ES
dc.typejournal articlees_ES
dspace.entity.typePublication
relation.isAuthorOfPublication31d7c9d0-70ef-44ef-af1d-e40f560c41bc
relation.isAuthorOfPublicationc98c1fe1-2016-44c1-9225-43fe1c6b8088
relation.isAuthorOfPublication0124b851-fdc5-473b-a559-32a1954aafd0
relation.isAuthorOfPublicationb3302f65-05d3-4b2c-b8b3-8503e58bba5e
relation.isAuthorOfPublication.latestForDiscovery31d7c9d0-70ef-44ef-af1d-e40f560c41bc

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
A.P.Diéguez_Solving_Large_Problem_Sizes_of_Index-Digit_Algorithms_on_GPU_FFT_2018.pdf
Size:
3.68 MB
Format:
Adobe Portable Document Format
Description: