Parallel prefix operations on GPU: tridiagonal system solvers and scan operators

Bibliographic citation

Diéguez, A.P., Amor, M. & Doallo, R. Parallel prefix operations on GPU: tridiagonal system solvers and scan operators. J Supercomput 75, 1510–1523 (2019). https://doi.org/10.1007/s11227-018-2676-z

Type of academic work

Academic degree

Abstract

[Abstract]: Modern GPUs can achieve high computing power at low cost, but still requires much time and effort. Tridiagonal system and scan solvers are one example of widely used algorithms which can take advantage of these devices. In this article, one tridiagonal system solver and two scan primitive operators are implemented on CUDA GPUs. To do so, a tuning strategy based on three phases is developed. Additionally, a performance analysis is performed for two different CUDA GPU architectures, resulting in a huge improvement with respect to the state of the art.

Description

This version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1007/s11227-018-2676-z

Rights

Copyright © 2018, Springer Science Business Media, LLC, part of Springer Nature