Analysis of Performance-impacting Factors on Checkpointing Frameworks: The CPPC Case Study
View/ Open
Use this link to cite
http://hdl.handle.net/2183/21953Collections
- GI-GAC - Artigos [190]
Metadata
Show full item recordTitle
Analysis of Performance-impacting Factors on Checkpointing Frameworks: The CPPC Case StudyDate
2011-11-01Citation
Gabriel Rodríguez, María J. Martín, Patricia González, Juan Touriño; Analysis of Performance-impacting Factors on Checkpointing Frameworks: The CPPC Case Study, The Computer Journal, Volume 54, Issue 11, 1 November 2011, Pages 1821–1837, https://doi.org/10.1093/comjnl/bxr018
Abstract
[Abstract] This paper focuses on the performance evaluation of Compiler for Portable Checkpointing (CPPC), a tool for the checkpointing of parallel message-passing applications. Its performance and the factors that impact it are transparently and rigorously identified and assessed. The tests were performed on a public supercomputing infrastructure, using a large number of very different applications and showing excellent results in terms of performance and effort required for integration into user codes. Statistical analysis techniques have been used to better approximate the performance of the tool. Quantitative and qualitative comparisons with other rollback-recovery approaches to fault tolerance are also included. All these data and comparisons are then discussed in an effort to extract meaningful conclusions about the state-of-the-art and future research trends in the rollback-recovery field.
Keywords
Fault tolerance
Checkpointing
Parallel programming
Message-passing
MPI
Checkpointing
Parallel programming
Message-passing
MPI
Description
This is a post-peer-review, pre-copyedit version of an article published in The Computer Journal. The final authenticated version is available online at: https://doi.org/10.1093/comjnl/bxr018
Editor version
ISSN
0010-4620
1460-2067
1460-2067