Buscar
Mostrando ítems 1-1 de 1
Fault tolerance of MPI applications in exascale systems: The ULFM solution
(Elsevier BV * North-Holland, 2020-05)
[Abstract]
The growth in the number of computational resources used by high-performance computing (HPC) systems leads to an increase in failure rates. Fault-tolerant techniques will become essential for long-running ...