• Failure Avoidance in MPI Applications Using an Application-Level Approach 

      Cores González, Iván; Rodríguez, Gabriel; González, Patricia; Martín, María J. (Oxford University Press, 2014)
      [Abstract] Execution times of large-scale computational science and engineering parallel applications are usually longer than the mean-time-between-failures. For this reason, hardware failures must be tolerated by the ...