A Fast Solver for Large Tridiagonal Systems on Multi-Core Processors (Lass Library)

UDC.coleccionInvestigaciónes_ES
UDC.departamentoEnxeñaría de Computadoreses_ES
UDC.endPage23378es_ES
UDC.grupoInvGrupo de Arquitectura de Computadores (GAC)es_ES
UDC.journalTitleIEEE Accesses_ES
UDC.startPage23365es_ES
UDC.volume7es_ES
dc.contributor.authorValero-Lara, Pedro
dc.contributor.authorAndrade, Diego
dc.contributor.authorSirvent, Raül
dc.contributor.authorLabarta, Jesús
dc.contributor.authorFraguela, Basilio B.
dc.contributor.authorDoallo, Ramón
dc.date.accessioned2023-11-16T19:43:29Z
dc.date.available2023-11-16T19:43:29Z
dc.date.issued2019
dc.description.abstract[Abstract]: Many problems of industrial and scientific interest require the solving of tridiagonal linear systems. This paper presents several implementations for the parallel solving of large tridiagonal systems on multi-core architectures, using the OmpSs programming model. The strategy used for the parallelization is based on the combination of two different existing algorithms, PCR and Thomas. The Thomas algorithm, which cannot be parallelized, requires the fewest number of floating point operations. The PCR algorithm is the most popular parallel method, but it is more computationally expensive than Thomas. The method proposed in this paper starts applying the PCR algorithm to break down one large tridiagonal system into a set of smaller and independent ones. In a second step, these independent systems are concurrently solved using Thomas. The paper also contains an analytical study of which is the best point to switch from PCR to Thomas. Also, the paper addresses the main performance issues of combining PCR and Thomas proposing a set of alternative implementations, some of them even imply algorithmic changes. The performance evaluation shows that the best implementation achieves a peak speedup of 4 with respect to the Intel MKL counterpart routine and 2.5 with respect to a single-threaded Thomas.es_ES
dc.description.sponsorshipThis work was supported in part by the European Union’s Horizon 2020 Framework Programme for Research and Innovation under the Specific Grant Agreements Human Brain Project SGA1 and Human Brain Project SGA2 under Grant 720270 and Grant 785907, in part by the Spanish Ministry of Economy and Competitiveness under the Project Computación de Altas Prestaciones VII under Grant TIN2015-65316-P, in part by the Departament d’Innovació, Universitats i Empresa de la Generalitat de Catalunya, under project MPEXPAR: Models de Programació i Entorns d’Execució Paralůlels under Grant 2014-SGR-1051, in part by the Juan de la Cierva under Grant IJCI-2017-33511, in part by the Fujitsu under the Barcelona Supercomputing Center-Fujitsu Joint Project: Math Libraries Migration and Optimization, in part by the Ministerio de Economía, Industria y Competitividad of Spain, in part by the Fondo Europeo de Desarrollo Regional Funds of the European Union under Grant TIN2016-75845-P, and in part by the Xunta de Galicia co-founded by the European Regional Development Fund (ERDF) under the Consolidation Programme of Competitive Reference Groups under Grant ED431C 2017/04, and in part by the Centro Singular de Investigación de Galicia accreditatión 2016-2019 under Grant ED431G/01.es_ES
dc.description.sponsorshipXunta de Galicia; ED431C 2017/04es_ES
dc.description.sponsorshipXunta de Galicia; ED431G/01es_ES
dc.description.sponsorshipGeneralitat de Catalunya; 2014-SGR-1051es_ES
dc.identifier.citationP. Valero-Lara, D. Andrade, R. Sirvent, J. Labarta, B. B. Fraguela and R. Doallo, "A Fast Solver for Large Tridiagonal Systems on Multi-Core Processors (Lass Library)," in IEEE Access, vol. 7, pp. 23365-23378, 2019, doi: 10.1109/ACCESS.2019.2900122.es_ES
dc.identifier.doi10.1109/ACCESS.2019.2900122
dc.identifier.issn2169-3536
dc.identifier.urihttp://hdl.handle.net/2183/34272
dc.language.isoenges_ES
dc.publisherInstitute of Electrical and Electronics Engineerses_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/MINECO/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN2016-75845-P/ES/NUEVOS DESAFIOS EN COMPUTACION DE ALTAS PRESTACIONES: DESDE ARQUITECTURAS HASTA APLICACIONES (II)/es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/ Plan Estatal de Investigación Científica y Técnica y de Innovación/IJCI-2017-33511/ES/es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/MINECO/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN2015-65316-P/ES/COMPUTACION DE ALTAS PRESTACIONES VII/es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/H2020/720270es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/H2020/785907es_ES
dc.relation.urihttps://ieeexplore.ieee.org/document/8643931es_ES
dc.rightsAtribución-NoComercial-SinDerivadas 3.0 Españaes_ES
dc.rights.accessRightsopen accesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/*
dc.subjectTridiagonal solvees_ES
dc.subjectMulti-corees_ES
dc.subjectAuto-tuninges_ES
dc.subjectOmpSses_ES
dc.subjectMathematical modeles_ES
dc.subjectLinear systemses_ES
dc.subjectComputational modelinges_ES
dc.subjectMemory managementes_ES
dc.titleA Fast Solver for Large Tridiagonal Systems on Multi-Core Processors (Lass Library)es_ES
dc.typejournal articlees_ES
dspace.entity.typePublication
relation.isAuthorOfPublicationba3b1a6d-65dd-4366-a7d4-f6c802c5f07a
relation.isAuthorOfPublication7f5bae1c-08f6-4204-b22a-fbe20407a6e4
relation.isAuthorOfPublicationb3302f65-05d3-4b2c-b8b3-8503e58bba5e
relation.isAuthorOfPublication.latestForDiscoveryba3b1a6d-65dd-4366-a7d4-f6c802c5f07a

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Valero_Lara_Pedro_2019_A_Fast_Solver_for_Large_Tridiagonal_Systems_on_Multi-Core_Processors_Lass_Library.pdf
Size:
5.9 MB
Format:
Adobe Portable Document Format
Description: