Buscar

Mostrando ítems 1-10 de 17

Resilient MPI applications using an application-level checkpointing framework and ULFM

Losada, Nuria; Cores González, Iván; Martín, María J.; González, Patricia (Springer New York LLC, 2017-01)

[Abstract] Future exascale systems, formed by millions of cores, will present high failure rates, and long-running applications will need to make use of new fault tolerance techniques to ensure successful execution completion. ...

Analysis of Performance-impacting Factors on Checkpointing Frameworks: The CPPC Case Study

Rodríguez, Gabriel; Martín, María J.; Touriño, Juan; González, Patricia (Oxford University Press, 2011-11-01)

[Abstract] This paper focuses on the performance evaluation of Compiler for Portable Checkpointing (CPPC), a tool for the checkpointing of parallel message-passing applications. Its performance and the factors that impact ...

ParBiBit: Parallel tool for binary biclustering on modern distributed-memory systems

González-Domínguez, Jorge; Expósito, Roberto R. (PLoS, 2018)

[Abstract]: Biclustering techniques are gaining attention in the analysis of large-scale datasets as they identify two-dimensional submatrices where both rows and columns are correlated. In this work we present ParBiBit, ...

Improving Scalability of Application-Level Checkpoint-Recovery by Reducing Checkpoint Sizes

Cores González, Iván; Rodríguez, Gabriel; Martín, María J.; González, Patricia; Osorio, Roberto (Springer Japan KK, 2013)

[Abstract] The execution times of large-scale parallel applications on nowadays multi/many-core systems are usually longer than the mean time between failures. Therefore, parallel applications must tolerate hardware failures ...

In-memory application-level checkpoint-based migration for MPI programs

Cores González, Iván; Rodríguez, Gabriel; Martín, María J.; González, Patricia (Springer New York LLC, 2014)

[Abstract] Process migration provides many benefits for parallel environments including dynamic load balancing, data access locality or fault tolerance. This paper describes an in-memory application-level checkpoint-based ...

CPPC: a compiler‐assisted tool for portable checkpointing of message‐passing applications

Rodríguez, Gabriel; Martín, María J.; González, Patricia; Touriño, Juan; Doallo, Ramón (John Wiley & Sons Ltd., 2010-11-19)

[Abstract] With the evolution of high‐performance computing toward heterogeneous, massively parallel systems, parallel applications have developed new checkpoint and restart necessities. Whether due to a failure in the ...

General‐purpose computation on GPUs for high performance cloud computing

Expósito, Roberto R.; Taboada, Guillermo L.; Ramos Garea, Sabela; Touriño, Juan; Doallo, Ramón (John Wiley & Sons Ltd., 2013-08)

[Abstract] Cloud computing is offering new approaches for High Performance Computing (HPC) as it provides dynamically scalable resources as a service over the Internet. In addition, General‐Purpose computation on Graphical ...

Scalable PGAS collective operations in NUMA clusters

Mallón, Damián A.; Teijeiro Barjas, Carlos; González-Domínguez, Jorge; Taboada, Guillermo L.; Gómez, Andrés (Springer New York LLC, 2014-12)

[Abstract] The increasing number of cores per processor is turning manycore-based systems in pervasive. This involves dealing with multiple levels of memory in non uniform memory access (NUMA) systems and processor cores ...

Parallel Brownian dynamics simulations with the message-passing and PGAS programming models

Teijeiro Barjas, Carlos; Sutmann, Godehard; Taboada, Guillermo L.; Touriño, Juan (Elsevier BV, 2013-04)

[Abstract] The simulation of particle dynamics is among the most important mechanisms to study the behavior of molecules in a medium under specific conditions of temperature and density. Several models can be used to compute ...

Performance analysis of HPC applications in the cloud

Expósito, Roberto R.; Taboada, Guillermo L.; Ramos Garea, Sabela; Touriño, Juan; Doallo, Ramón (Elsevier BV * North-Holland, 2013-01)

[Abstract] The scalability of High Performance Computing (HPC) applications depends heavily on the efficient support of network communications in virtualized environments. However, Infrastructure as a Service (IaaS) providers ...