• Characterization of message-passing overhead on the AP3000 multicomputer 

      Touriño, Juan; Doallo, Ramón (IEEE, 2001-09)
      [Abstract] The performance of the communication primitives of parallel computers is critical for the overall system performance. The characterization of the communication overhead is very important to estimate the global ...
    • Clupiter: a Raspberry Pi mini-supercomputer for educational purposes 

      Rodríguez-Iglesias, Alonso; Martín, María J.; Touriño, Juan (Institute of Electrical and Electronics Engineers, 2024)
      [Abstract]: The main objective of this work is to bring supercomputing and parallel processing closer to non-specialized audiences by building a Raspberry Pi cluster, called Clupiter, which emulates the operation of a ...
    • Communication avoiding and overlapping for numerical linear algebra 

      Georganas, Evangelos; González-Domínguez, Jorge; Solomonik, Edgar; Zheng, Yili; Touriño, Juan; Yelick, Katherine (IEEE Computer Society, 2013-02-25)
      [Abstract] To efficiently scale dense linear algebra problems to future exascale systems, communication cost must be avoided or overlapped. Communication-avoiding 2.5D algorithms improve scalability by reducing inter-processor ...
    • Comparison of Hardwired and Microprogrammed Statechart Implementations 

      Cereijo García, Javier; Osorio, Roberto (MDPI, 2020)
      [Abstract]: In scientific facilities such as particle accelerators, fast and jitter-free synchronization is required in order to trigger a large number of actuators at the right time in a variety of situations. The behaviour ...
    • Compiler support for parallel code generation through kernel recognition 

      Arenaz Silva, Manuel; Touriño, Juan; Doallo, Ramón (IEEE Computer Society, 2004-06-07)
      [Abstract] Summary form only given. The automatic parallelization of loops that contain complex computations is still a challenge for current parallelizing compilers. The main limitations are related to the analysis of ...
    • Compiler-Assisted Checkpointing of Parallel Codes: The Cetus and LLVM Experience 

      Rodríguez, Gabriel; Martín, María J.; González, Patricia; Touriño, Juan; Doallo, Ramón (Springer New York LLC, 2013)
      [Abstract] With the evolution of high-performance computing, parallel applications have developed an increasing necessity for fault tolerance, most commonly provided by checkpoint and restart techniques. Checkpointing tools ...
    • Concept Drift Detection and Adaptation for Federated and Continual Learning 

      Casado, Fernando E.; Lema, Dylan; Criado, Marcos F.; Iglesias, Roberto; Regueiro, Carlos V.; Barro, Senén (Springer, 2021)
      [Abstract] Smart devices, such as smartphones, wearables, robots, and others, can collect vast amounts of data from their environment. This data is suitable for training machine learning models, which can significantly ...
    • CPPC: a compiler‐assisted tool for portable checkpointing of message‐passing applications 

      Rodríguez, Gabriel; Martín, María J.; González, Patricia; Touriño, Juan; Doallo, Ramón (John Wiley & Sons Ltd., 2010-11-19)
      [Abstract] With the evolution of high‐performance computing toward heterogeneous, massively parallel systems, parallel applications have developed new checkpoint and restart necessities. Whether due to a failure in the ...
    • CUDA-JMI: Acceleration of feature selection on heterogeneous systems 

      González-Domínguez, Jorge; Expósito, Roberto R.; Bolón-Canedo, Verónica (Elsevier, 2020-01)
      [Abstract]: Feature selection is a crucial step nowadays in machine learning and data analytics to remove irrelevant and redundant characteristics and thus to provide fast and reliable analyses. Many research works have ...
    • Design and Implementation of an extended collectives library for unified Parallel C 

      Teijeiro Barjas, Carlos; Taboada, Guillermo L.; Touriño, Juan; Doallo, Ramón; Mouriño, José C.; Mallón, Damián A.; Wibecan, Brian (Springer New York LLC, 2013)
      [Abstract] Unified Parallel C (UPC) is a parallel extension of ANSI C based on the Partitioned Global Address Space (PGAS) programming model, which provides a shared memory view that simplifies code development while it ...
    • Design and Implementation of MapReduce using the PGAS Programming Model with UPC 

      Teijeiro Barjas, Carlos; Taboada, Guillermo L.; Touriño, Juan; Doallo, Ramón (IEEE Computer Society, 2012-01-03)
      [Abstract] MapReduce is a powerful tool for processing large data sets used by many applications running in distributed environments. However, despite the increasing number of computationally intensive problems that require ...
    • Design of efficient Java message-passing collectives on multi-core clusters 

      Taboada, Guillermo L.; Ramos Garea, Sabela; Touriño, Juan; Doallo, Ramón (Springer New York LLC, 2011-02)
      [Abstract] This paper presents a scalable and efficient Message-Passing in Java (MPJ) collective communication library for parallel computing on multi-core architectures. The continuous increase in the number of cores per ...
    • Design of Scalable Java Communication Middleware for Multi-Core Systems 

      Ramos Garea, Sabela; Taboada, Guillermo L.; Expósito, Roberto R.; Touriño, Juan; Doallo, Ramón (Oxford University Press, 2013-02-01)
      [Abstract] This paper presents smdev, a shared memory communication middleware for multi-core systems. smdev provides a simple and powerful messaging application program interface that is able to exploit the underlying ...
    • Design of scalable Java message-passing communications over InfiniBand 

      Expósito, Roberto R.; Taboada, Guillermo L.; Touriño, Juan; Doallo, Ramón (Springer New York LLC, 2012-07)
      [Abstract] This paper presents ibvdev a scalable and efficient low-level Java message-passing communication device over InfiniBand. The continuous increase in the number of cores per processor underscores the need for ...
    • Developing adaptive multi-device applications with the Heterogeneous Programming Library 

      Viñas Buceta, Moisés; Bozkus, Zeki; Fraguela, Basilio B.; Andrade, Diego; Doallo, Ramón (Springer, 2015)
      [Abstract] The usage of heterogeneous devices presents two main problems. One is their complex programming, a problem that grows when multiple devices are used. The second issue is that even if the codes for these devices ...
    • Device level communication libraries for high‐performance computing in Java 

      Taboada, Guillermo L.; Touriño, Juan; Doallo, Ramón; Shafi, Aamir; Baker, Mark; Carpenter, Bryan (John Wiley & Sons Ltd., 2011-12-25)
      [Abstract] Since its release, the Java programming language has attracted considerable attention from the high‐performance computing (HPC) community because of its portability, high programming productivity, and built‐in ...
    • Does the choice of nucleotide substitution models matter topologically? 

      Hoff, Michael; Orf, Stefan; Riehm, Benedikt; Darriba, Diego; Stamatakis, Alexandros (BioMed Central Ltd., 2016-03-24)
      [Abstract] Background: In the context of a master level programming practical at the computer science department of the Karlsruhe Institute of Technology, we developed and make available an open-source code for testing all ...
    • Easy Dataflow Programming in Clusters with UPC++ DepSpawn 

      Fraguela, Basilio B.; Andrade, Diego (Institute of Electrical and Electronics Engineers, 2019-06-01)
      [Abstract]: The Partitioned Global Address Space (PGAS) programming model is one of the most relevant proposals to improve the ability of developers to exploit distributed memory systems. However, despite its important ...
    • Efficient Culling Techniques for Interactive Deformable NURBS Surfaces on GPU 

      Concheiro, Raquel; Amor, Margarita; Padrón, Emilio J.; Doggett, Michael (SciTePress, 2016-02)
      [Abstrtact] InfoValue: NURBS (Non-uniform rational B-splines) surfaces are the standard freeform representation in Computer-Aided Design (CAD) applications. Rendering NURBS surfaces accurately while they are interactively ...
    • Efficient high-precision integer multiplication on the GPU 

      Pérez Diéguez, Adrián; Amor, Margarita; Doallo, Ramón; Nukada, Akira; Matsuoka, Satoshi (SAGE Journals, 2022-03)
      [Abstract]: The multiplication of large integers, which has many applications in computer science, is an operation that can be expressed as a polynomial multiplication followed by a carry normalization. This work develops ...