Mostrar o rexistro simple do ítem

dc.contributor.authorCores González, Iván
dc.contributor.authorRodríguez, Gabriel
dc.contributor.authorMartín, María J.
dc.contributor.authorGonzález, Patricia
dc.contributor.authorOsorio, Roberto
dc.date.accessioned2018-08-06T10:18:29Z
dc.date.available2018-08-06T10:18:29Z
dc.date.issued2013
dc.identifier.citationCores, I., Rodríguez, G., martín, M.J. et al. New Gener. Comput. (2013) 31: 163. https://doi.org/10.1007/s00354-013-0302-4es_ES
dc.identifier.issn0288-3635
dc.identifier.issn1882-7055
dc.identifier.urihttp://hdl.handle.net/2183/20945
dc.descriptionThis is a post-peer-review, pre-copyedit version of an article published in New Generation Computing. The final authenticated version is available online at: https://doi.org/10.1007/s00354-013-0302-4es_ES
dc.description.abstract[Abstract] The execution times of large-scale parallel applications on nowadays multi/many-core systems are usually longer than the mean time between failures. Therefore, parallel applications must tolerate hardware failures to ensure that not all computation done is lost on machine failures. Checkpointing and rollback recovery is one of the most popular techniques to implement fault-tolerant applications. However, checkpointing parallel applications is expensive in terms of computing time, network utilization and storage resources. Thus, current checkpoint-recovery techniques should minimize these costs in order to be useful for large scale systems. In this paper three different and complementary techniques to reduce the size of the checkpoints generated by application-level checkpointing are proposed and implemented. Detailed experimental results obtained on a multicore cluster show the effectiveness of the proposed methods to reduce checkpointing cost.es_ES
dc.description.sponsorshipMinisterio de Ciencia e Innovación; TIN2010-16735es_ES
dc.description.sponsorshipGalicia. Consellería de Economía e Industria; 10PXIB105180PRes_ES
dc.language.isoenges_ES
dc.publisherSpringer Japan KKes_ES
dc.relation.urihttps://doi.org/10.1007/s00354-013-0302-4es_ES
dc.subjectParallel programminges_ES
dc.subjectMessage passinges_ES
dc.subjectMPIes_ES
dc.subjectFault tolerancees_ES
dc.subjectCheckpointinges_ES
dc.titleImproving Scalability of Application-Level Checkpoint-Recovery by Reducing Checkpoint Sizeses_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.rights.accessinfo:eu-repo/semantics/openAccesses_ES
UDC.journalTitleNew Generation Computinges_ES
UDC.volume31es_ES
UDC.issue3es_ES
UDC.startPage163es_ES
UDC.endPage185es_ES
dc.identifier.doi10.1007/s00354-013-0302-4


Ficheiros no ítem

Thumbnail

Este ítem aparece na(s) seguinte(s) colección(s)

Mostrar o rexistro simple do ítem