Listar Grupo de Arquitectura de Computadores (GAC) por título

Characterization of message-passing overhead on the AP3000 multicomputer

Touriño, Juan; Doallo, Ramón (IEEE, 2001-09)

[Abstract] The performance of the communication primitives of parallel computers is critical for the overall system performance. The characterization of the communication overhead is very important to estimate the global ...

Clupiter: a Raspberry Pi mini-supercomputer for educational purposes

Rodríguez-Iglesias, Alonso; Martín, María J.; Touriño, Juan (Institute of Electrical and Electronics Engineers, 2024)

[Abstract]: The main objective of this work is to bring supercomputing and parallel processing closer to non-specialized audiences by building a Raspberry Pi cluster, called Clupiter, which emulates the operation of a ...

Communication avoiding and overlapping for numerical linear algebra

Georganas, Evangelos; González-Domínguez, Jorge; Solomonik, Edgar; Zheng, Yili; Touriño, Juan; Yelick, Katherine (IEEE Computer Society, 2013-02-25)

[Abstract] To efficiently scale dense linear algebra problems to future exascale systems, communication cost must be avoided or overlapped. Communication-avoiding 2.5D algorithms improve scalability by reducing inter-processor ...

Comparison of Hardwired and Microprogrammed Statechart Implementations

Cereijo García, Javier; Osorio, Roberto (MDPI, 2020)

[Abstract]: In scientific facilities such as particle accelerators, fast and jitter-free synchronization is required in order to trigger a large number of actuators at the right time in a variety of situations. The behaviour ...

Compiler support for parallel code generation through kernel recognition

Arenaz Silva, Manuel; Touriño, Juan; Doallo, Ramón (IEEE Computer Society, 2004-06-07)

[Abstract] Summary form only given. The automatic parallelization of loops that contain complex computations is still a challenge for current parallelizing compilers. The main limitations are related to the analysis of ...

Compiler-Assisted Checkpointing of Parallel Codes: The Cetus and LLVM Experience

Rodríguez, Gabriel; Martín, María J.; González, Patricia; Touriño, Juan; Doallo, Ramón (Springer New York LLC, 2013)

[Abstract] With the evolution of high-performance computing, parallel applications have developed an increasing necessity for fault tolerance, most commonly provided by checkpoint and restart techniques. Checkpointing tools ...

Concept Drift Detection and Adaptation for Federated and Continual Learning

Casado, Fernando E.; Lema, Dylan; Criado, Marcos F.; Iglesias, Roberto; Regueiro, Carlos V.; Barro, Senén (Springer, 2021)

[Abstract] Smart devices, such as smartphones, wearables, robots, and others, can collect vast amounts of data from their environment. This data is suitable for training machine learning models, which can significantly ...

CPPC: a compiler‐assisted tool for portable checkpointing of message‐passing applications

Rodríguez, Gabriel; Martín, María J.; González, Patricia; Touriño, Juan; Doallo, Ramón (John Wiley & Sons Ltd., 2010-11-19)

[Abstract] With the evolution of high‐performance computing toward heterogeneous, massively parallel systems, parallel applications have developed new checkpoint and restart necessities. Whether due to a failure in the ...

CUDA acceleration of MI-based feature selection methods

Beceiro, Bieito; González-Domínguez, Jorge; Morán-Fernández, Laura; Bolón-Canedo, Verónica; Touriño, Juan (Elsevier, 2024-08)

[Abstract]: Feature selection algorithms are necessary nowadays for machine learning as they are capable of removing irrelevant and redundant information to reduce the dimensionality of the data and improve the quality of ...

CUDA-JMI: Acceleration of feature selection on heterogeneous systems

González-Domínguez, Jorge; Expósito, Roberto R.; Bolón-Canedo, Verónica (Elsevier, 2020-01)

[Abstract]: Feature selection is a crucial step nowadays in machine learning and data analytics to remove irrelevant and redundant characteristics and thus to provide fast and reliable analyses. Many research works have ...

Design and Implementation of an extended collectives library for unified Parallel C

Teijeiro Barjas, Carlos; Taboada, Guillermo L.; Touriño, Juan; Doallo, Ramón; Mouriño, José C.; Mallón, Damián A.; Wibecan, Brian (Springer New York LLC, 2013)

[Abstract] Unified Parallel C (UPC) is a parallel extension of ANSI C based on the Partitioned Global Address Space (PGAS) programming model, which provides a shared memory view that simplifies code development while it ...

Design and Implementation of MapReduce using the PGAS Programming Model with UPC

Teijeiro Barjas, Carlos; Taboada, Guillermo L.; Touriño, Juan; Doallo, Ramón (IEEE Computer Society, 2012-01-03)

[Abstract] MapReduce is a powerful tool for processing large data sets used by many applications running in distributed environments. However, despite the increasing number of computationally intensive problems that require ...

Design of efficient Java message-passing collectives on multi-core clusters

Taboada, Guillermo L.; Ramos Garea, Sabela; Touriño, Juan; Doallo, Ramón (Springer New York LLC, 2011-02)

[Abstract] This paper presents a scalable and efficient Message-Passing in Java (MPJ) collective communication library for parallel computing on multi-core architectures. The continuous increase in the number of cores per ...

Design of Scalable Java Communication Middleware for Multi-Core Systems

Ramos Garea, Sabela; Taboada, Guillermo L.; Expósito, Roberto R.; Touriño, Juan; Doallo, Ramón (Oxford University Press, 2013-02-01)

[Abstract] This paper presents smdev, a shared memory communication middleware for multi-core systems. smdev provides a simple and powerful messaging application program interface that is able to exploit the underlying ...

Design of scalable Java message-passing communications over InfiniBand

Expósito, Roberto R.; Taboada, Guillermo L.; Touriño, Juan; Doallo, Ramón (Springer New York LLC, 2012-07)

[Abstract] This paper presents ibvdev a scalable and efficient low-level Java message-passing communication device over InfiniBand. The continuous increase in the number of cores per processor underscores the need for ...

Developing adaptive multi-device applications with the Heterogeneous Programming Library

Viñas Buceta, Moisés; Bozkus, Zeki; Fraguela, Basilio B.; Andrade, Diego; Doallo, Ramón (Springer, 2015)

[Abstract] The usage of heterogeneous devices presents two main problems. One is their complex programming, a problem that grows when multiple devices are used. The second issue is that even if the codes for these devices ...

Device level communication libraries for high‐performance computing in Java

Taboada, Guillermo L.; Touriño, Juan; Doallo, Ramón; Shafi, Aamir; Baker, Mark; Carpenter, Bryan (John Wiley & Sons Ltd., 2011-12-25)

[Abstract] Since its release, the Java programming language has attracted considerable attention from the high‐performance computing (HPC) community because of its portability, high programming productivity, and built‐in ...

Does the choice of nucleotide substitution models matter topologically?

Hoff, Michael; Orf, Stefan; Riehm, Benedikt; Darriba, Diego; Stamatakis, Alexandros (BioMed Central Ltd., 2016-03-24)

[Abstract] Background: In the context of a master level programming practical at the computer science department of the Karlsruhe Institute of Technology, we developed and make available an open-source code for testing all ...

Easy Dataflow Programming in Clusters with UPC++ DepSpawn

Fraguela, Basilio B.; Andrade, Diego (Institute of Electrical and Electronics Engineers, 2019-06-01)

[Abstract]: The Partitioned Global Address Space (PGAS) programming model is one of the most relevant proposals to improve the ability of developers to exploit distributed memory systems. However, despite its important ...

Efficient Culling Techniques for Interactive Deformable NURBS Surfaces on GPU

Concheiro, Raquel; Amor, Margarita; Padrón, Emilio J.; Doggett, Michael (SciTePress, 2016-02)

[Abstrtact] InfoValue: NURBS (Non-uniform rational B-splines) surfaces are the standard freeform representation in Computer-Aided Design (CAD) applications. Rendering NURBS surfaces accurately while they are interactively ...