Analysis of Performance-impacting Factors on Checkpointing Frameworks: The CPPC Case Study
Ver/ abrir
Use este enlace para citar
http://hdl.handle.net/2183/21953Coleccións
- GI-GAC - Artigos [193]
Metadatos
Mostrar o rexistro completo do ítemTítulo
Analysis of Performance-impacting Factors on Checkpointing Frameworks: The CPPC Case StudyData
2011-11-01Cita bibliográfica
Gabriel Rodríguez, María J. Martín, Patricia González, Juan Touriño; Analysis of Performance-impacting Factors on Checkpointing Frameworks: The CPPC Case Study, The Computer Journal, Volume 54, Issue 11, 1 November 2011, Pages 1821–1837, https://doi.org/10.1093/comjnl/bxr018
Resumo
[Abstract] This paper focuses on the performance evaluation of Compiler for Portable Checkpointing (CPPC), a tool for the checkpointing of parallel message-passing applications. Its performance and the factors that impact it are transparently and rigorously identified and assessed. The tests were performed on a public supercomputing infrastructure, using a large number of very different applications and showing excellent results in terms of performance and effort required for integration into user codes. Statistical analysis techniques have been used to better approximate the performance of the tool. Quantitative and qualitative comparisons with other rollback-recovery approaches to fault tolerance are also included. All these data and comparisons are then discussed in an effort to extract meaningful conclusions about the state-of-the-art and future research trends in the rollback-recovery field.
Palabras chave
Fault tolerance
Checkpointing
Parallel programming
Message-passing
MPI
Checkpointing
Parallel programming
Message-passing
MPI
Descrición
This is a post-peer-review, pre-copyedit version of an article published in The Computer Journal. The final authenticated version is available online at: https://doi.org/10.1093/comjnl/bxr018
Versión do editor
ISSN
0010-4620
1460-2067
1460-2067