A Fast Solver for Large Tridiagonal Systems on Multi-Core Processors (Lass Library)
Use este enlace para citar
http://hdl.handle.net/2183/34272
A non ser que se indique outra cousa, a licenza do ítem descríbese como Atribución-NoComercial-SinDerivadas 3.0 España
Coleccións
- GI-GAC - Artigos [193]
- OpenAIRE [368]
Metadatos
Mostrar o rexistro completo do ítemTítulo
A Fast Solver for Large Tridiagonal Systems on Multi-Core Processors (Lass Library)Autor(es)
Data
2019Cita bibliográfica
P. Valero-Lara, D. Andrade, R. Sirvent, J. Labarta, B. B. Fraguela and R. Doallo, "A Fast Solver for Large Tridiagonal Systems on Multi-Core Processors (Lass Library)," in IEEE Access, vol. 7, pp. 23365-23378, 2019, doi: 10.1109/ACCESS.2019.2900122.
Resumo
[Abstract]: Many problems of industrial and scientific interest require the solving of tridiagonal linear systems. This paper presents several implementations for the parallel solving of large tridiagonal systems on multi-core architectures, using the OmpSs programming model. The strategy used for the parallelization is based on the combination of two different existing algorithms, PCR and Thomas. The Thomas algorithm, which cannot be parallelized, requires the fewest number of floating point operations. The PCR algorithm is the most popular parallel method, but it is more computationally expensive than Thomas. The method proposed in this paper starts applying the PCR algorithm to break down one large tridiagonal system into a set of smaller and independent ones. In a second step, these independent systems are concurrently solved using Thomas. The paper also contains an analytical study of which is the best point to switch from PCR to Thomas. Also, the paper addresses the main performance issues of combining PCR and Thomas proposing a set of alternative implementations, some of them even imply algorithmic changes. The performance evaluation shows that the best implementation achieves a peak speedup of 4 with respect to the Intel MKL counterpart routine and 2.5 with respect to a single-threaded Thomas.
Palabras chave
Tridiagonal solve
Multi-core
Auto-tuning
OmpSs
Mathematical model
Linear systems
Computational modeling
Memory management
Multi-core
Auto-tuning
OmpSs
Mathematical model
Linear systems
Computational modeling
Memory management
Versión do editor
Dereitos
Atribución-NoComercial-SinDerivadas 3.0 España
ISSN
2169-3536