• Soft-fault recovery in MPI applications 

      Fernández Rey, David (2020-09)
      [Abstract] Current high-performance computing (HPC) systems are comprised of thousands of CPU cores, and this number is expected to grow into the millions in the near future. With such an elevated number of processors, ...