    • BPLG–BMCS: GPU-sorting algorithm using a tuning skeleton library 

      Pérez Diéguez, Adrián; Amor, Margarita; Doallo Biempica, Ramón (Springer New York LLC, 2017)
      [Abstract] In this work, we present an efficient and portable sorting operator for GPUs. Specifically, we propose an algorithmic variant of the bitonic merge sort which reduces the number of processing stages and internal ...
    • Parallel prefix operations on heterogeneous platforms 

      Pérez Diéguez, Adrián (2018)
      [Resumo] As tarxetas gráficas, coñecidas como GPUs, aportan grandes vantaxes no rendemento computacional e na eficiencia enerxética, sendo un piar clave para a computación de altas prestacións (HPC). Sen embargo, esta ...
    • Solving Large Problem Sizes of Index-Digit Algorithms on GPU: FFT and Tridiagonal System Solvers 

      Pérez Diéguez, Adrián; Amor, Margarita; Lobeiras Blanco, Jacobo; Doallo Biempica, Ramón (Institute of Electrical and Electronics Engineers, 2018)
      [Abstract] Current Graphics Processing Units (GPUs) are capable of obtaining high computational performance in scientific applications. Nevertheless, programmers have to use suitable parallel algorithms for these architectures ...