A Heuristic Approach for the Automatic Insertion of Checkpoints in Message-Passing Codes

UDC.coleccionInvestigaciónes_ES
UDC.departamentoEnxeñaría de Computadoreses_ES
UDC.endPage2911es_ES
UDC.grupoInvGrupo de Arquitectura de Computadores (GAC)es_ES
UDC.issue14es_ES
UDC.journalTitleJournal of Universal Computer Sciencees_ES
UDC.startPage2894es_ES
UDC.volume15es_ES
dc.contributor.authorRodríguez, Gabriel
dc.contributor.authorMartín, María J.
dc.contributor.authorGonzález, Patricia
dc.contributor.authorTouriño, Juan
dc.date.accessioned2019-02-15T15:26:46Z
dc.date.available2019-02-15T15:26:46Z
dc.date.issued2009-08
dc.description.abstract[Abstract] Checkpointing tools may be typically implemented at two different abstraction levels: at the system level or at the application level. The latter has become a more popular alternative due to its flexibility and the possibility of operating in different environments. However, application-level checkpointing tools often require the user to manually insert checkpoints in order to ensure that certain requirements are met (e.g. forcing checkpoints to be taken at the user code and not inside kernel routines). The approach presented in this work is twofold. First, a spatial coordination protocol for checkpointing parallel SPMD applications is proposed, based on forcing checkpoints to be taken at the same places in the application code by all processes. Thus, global consistency is achieved without adding any new runtime communications or piggybacked data, and without the need to use specific fault-tolerant message-passing implementations. Second, the paper also introduces a compilation technique for the automatic insertion of checkpoints using the spatial coordination protocol, based on a static analysis of communications and a heuristic analysis of computational load. These analyses can also be used to achieve automatic checkpoint insertion in approaches based on classical protocols, such as uncoordinated checkpointing or distributed snapshots.es_ES
dc.description.sponsorshipMinisterio de Ciencia e Innovación; TIN-2007-67537-C03-02es_ES
dc.identifier.citationRODRIGUEZ, Gabriel, et al. A heuristic approach for the automatic insertion of checkpoints in message-passing codes. Journal of Universal Computer Science, 2009, vol. 15, no 14, p. 2894-2911.es_ES
dc.identifier.doi10.3217/jucs-015-14-2894
dc.identifier.issn0948-695X
dc.identifier.issn0948-6968
dc.identifier.urihttp://hdl.handle.net/2183/21807
dc.language.isoenges_ES
dc.publisherTechnische Universitaet Graz * Institut fuer Informationssysteme und Computer Medien,Graz University of Technology, Institute for Information Systems and Computer Mediaes_ES
dc.relation.urihttp://dx.doi.org/10.3217/jucs-015-14-2894es_ES
dc.rights.accessRightsopen accesses_ES
dc.subjectCheckpointinges_ES
dc.subjectCompiler supportes_ES
dc.subjectFault tolerancees_ES
dc.subjectMessage passinges_ES
dc.subjectParallel programminges_ES
dc.titleA Heuristic Approach for the Automatic Insertion of Checkpoints in Message-Passing Codeses_ES
dc.typejournal articlees_ES
dspace.entity.typePublication
relation.isAuthorOfPublicatione432b4b1-5ead-41aa-b165-d69608b06626
relation.isAuthorOfPublication049797cb-6695-43ea-8f32-efc754fbfda6
relation.isAuthorOfPublication0ed2a744-9046-4c62-8300-a17ef95bea86
relation.isAuthorOfPublication86e306a5-99a1-4c43-8faa-720f0a9f0a34
relation.isAuthorOfPublication.latestForDiscoverye432b4b1-5ead-41aa-b165-d69608b06626

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
G.Rodríguez_2009_A_Heuristic_Approach_for_the_Automatic_Insertion_of_Checkpoints_in_Message-Passing_Codes.pdf
Size:
189.28 KB
Format:
Adobe Portable Document Format
Description: