Pérez Diéguez, AdriánAmor, MargaritaDoallo, Ramón2025-01-212025-01-212019Diéguez, A.P., Amor, M. & Doallo, R. Parallel prefix operations on GPU: tridiagonal system solvers and scan operators. J Supercomput 75, 1510–1523 (2019). https://doi.org/10.1007/s11227-018-2676-z0920-8542http://hdl.handle.net/2183/40804This version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1007/s11227-018-2676-z[Abstract]: Modern GPUs can achieve high computing power at low cost, but still requires much time and effort. Tridiagonal system and scan solvers are one example of widely used algorithms which can take advantage of these devices. In this article, one tridiagonal system solver and two scan primitive operators are implemented on CUDA GPUs. To do so, a tuning strategy based on three phases is developed. Additionally, a performance analysis is performed for two different CUDA GPU architectures, resulting in a huge improvement with respect to the state of the art.engCopyright © 2018, Springer Science Business Media, LLC, part of Springer NatureCUDAGPUScanTridiagonal systemsTuningParallel prefix operations on GPU: tridiagonal system solvers and scan operatorsjournal articleopen access10.1007/s11227-018-2676-z