• MPI and UPC broadcast, scatter and gather algorithms in Xeon Phi 

      Mallón, Damián A.; Taboada, Guillermo L.; Koesterke, Lars (John Wiley & Sons Ltd., 2016-05-06)
      [Abstract] Accelerators have revolutionised the high performance computing (HPC) community. Despite their advantages, their very specific programming models and limited communication capabilities have kept them in a ...
    • Nonblocking collectives for scalable Java communications 

      Ramos Garea, Sabela; Taboada, Guillermo L.; Expósito, Roberto R.; Touriño, Juan (John Wiley & Sons Ltd., 2015-04-22)
      [Abstract] This paper presents a Java implementation of the recently published MPI 3.0 nonblocking message passing collectives in order to analyze and assess the feasibility of taking advantage of these operations in shared ...
    • Scalable PGAS collective operations in NUMA clusters 

      Mallón, Damián A.; Teijeiro Barjas, Carlos; González-Domínguez, Jorge; Taboada, Guillermo L.; Gómez, Andrés (Springer New York LLC, 2014-12)
      [Abstract] The increasing number of cores per processor is turning manycore-based systems in pervasive. This involves dealing with multiple levels of memory in non uniform memory access (NUMA) systems and processor cores ...