• A Microprogrammed Approach for Implementing Statecharts 

      Cereijo García, Javier; Osorio, Roberto (Institute of Electrical and Electronics Engineers, 2019-10-21)
      [Abstract] Statechart diagrams allow specifying complex systems in which there may be several states active at the same time and a large number of events and transitions to evaluate. Statecharts have been found useful in ...
    • A PVM Based Library for Sparse Matrix Factorizations 

      Touriño, Juan; Doallo, Ramón (Springer, 1998)
      [Abstract] We present 3LM, a C Linked List Management Library for parallel sparse factorizations on a PVM environment which takes into account the fill-in, an important drawback of sparse computations. It is restricted to ...
    • Accelerating the quality control of genetic sequences through stream processing 

      Castellanos Rodríguez, Óscar; Expósito, Roberto R.; Touriño, Juan (Association for Computing Machinery, 2023)
      [Abstract]: Quality control of DNA sequences is an important data preprocessing step in many genomic analyses. However, all existing parallel tools for this purpose are based on a batch processing model, needing to have ...
    • Acceleration of a Feature Selection Algorithm Using High Performance Computing 

      Beceiro, Bieito; González-Domínguez, Jorge; Touriño, Juan (MDPI AG, 2020-09-01)
      [Abstract] Feature selection is a subfield of data analysis that is on reducing the dimensionality of datasets, so that subsequent analyses over them can be performed in affordable execution times while keeping the same ...
    • An Inspector-Executor Algorithm for Irregular Assignment Parallelization 

      Arenaz Silva, Manuel; Touriño, Juan; Doallo, Ramón (Springer, 2004)
      [Abstract] A loop with irregular assignment computations contains loop-carried output data dependences that can only be detected at run-time. In this paper, a load-balanced method based on the inspector-executor model is ...
    • Characterization of message-passing overhead on the AP3000 multicomputer 

      Touriño, Juan; Doallo, Ramón (IEEE, 2001-09)
      [Abstract] The performance of the communication primitives of parallel computers is critical for the overall system performance. The characterization of the communication overhead is very important to estimate the global ...
    • Clupiter: a Raspberry Pi mini-supercomputer for educational purposes 

      Rodríguez-Iglesias, Alonso; Martín, María J.; Touriño, Juan (Institute of Electrical and Electronics Engineers, 2024)
      [Abstract]: The main objective of this work is to bring supercomputing and parallel processing closer to non-specialized audiences by building a Raspberry Pi cluster, called Clupiter, which emulates the operation of a ...
    • Communication avoiding and overlapping for numerical linear algebra 

      Georganas, Evangelos; González-Domínguez, Jorge; Solomonik, Edgar; Zheng, Yili; Touriño, Juan; Yelick, Katherine (IEEE Computer Society, 2013-02-25)
      [Abstract] To efficiently scale dense linear algebra problems to future exascale systems, communication cost must be avoided or overlapped. Communication-avoiding 2.5D algorithms improve scalability by reducing inter-processor ...
    • Compiler support for parallel code generation through kernel recognition 

      Arenaz Silva, Manuel; Touriño, Juan; Doallo, Ramón (IEEE Computer Society, 2004-06-07)
      [Abstract] Summary form only given. The automatic parallelization of loops that contain complex computations is still a challenge for current parallelizing compilers. The main limitations are related to the analysis of ...
    • Design and Implementation of MapReduce using the PGAS Programming Model with UPC 

      Teijeiro Barjas, Carlos; López Taboada, Guillermo; Touriño, Juan; Doallo, Ramón (IEEE Computer Society, 2012-01-03)
      [Abstract] MapReduce is a powerful tool for processing large data sets used by many applications running in distributed environments. However, despite the increasing number of computationally intensive problems that require ...
    • Efficient Culling Techniques for Interactive Deformable NURBS Surfaces on GPU 

      Concheiro, Raquel; Amor, Margarita; Padrón, Emilio J.; Doggett, Michael (SciTePress, 2016-02)
      [Abstrtact] InfoValue: NURBS (Non-uniform rational B-splines) surfaces are the standard freeform representation in Computer-Aided Design (CAD) applications. Rendering NURBS surfaces accurately while they are interactively ...
    • Efficient Java Communication Protocols on High-speed Cluster Interconnects 

      López Taboada, Guillermo; Touriño, Juan; Doallo, Ramón (IEEE Computer Society, 2007-02-26)
      [Abstract] This paper presents communication strategies for achieving efficient parallel and distributed Java applications on clusters with high-speed interconnects. Communication performance is critical for the overall ...
    • Enabling Hardware Affinity in JVM-Based Applications: A Case Study for Big Data 

      Expósito, Roberto R.; Veiga, Jorge; Touriño, Juan (Springer, 2020)
      [Abstract]: Java has been the backbone of Big Data processing for more than a decade due to its interesting features such as object orientation, cross-platform portability and good programming productivity. In fact, most ...
    • Evaluation of Java for General Purpose GPU Computing 

      Docampo, Jorge; Ramos Garea, Sabela; López Taboada, Guillermo; Expósito, Roberto R.; Touriño, Juan; Doallo, Ramón (IEEE Computer Society, 2013-07-01)
      [Abstract] The presence of many-core units as accelerators has been increasing due to their ability to improve the performance of highly parallel workloads. General Purpose GPU(GPGPU) computing has allowed the graphical ...
    • Evaluation of Parallel Differential Evolution Implementations on MapReduce and Spark 

      Teijeiro, Diego; Pardo, Xoán C.; Penas, David R.; González, Patricia; Banga, Julio R.; Doallo, Ramón (Springer, 2017-09)
      [Abstract] Global optimization problems arise in many areas of science and engineering, computational and systems biology and bioinformatics among them. Many research efforts have focused on developing parallel metaheuristics ...
    • Exploiting locality in the run-time parallelization of irregular loops 

      Martín, María J.; Singh, David E.; Touriño, Juan; Rivera, Francisco F. (C R C Press, LLC, 2002-12-10)
      [Abstract] The goal of this work is the efficient parallel execution of loops with indirect array accesses, in order to be embedded in a parallelizing compiler framework. In this kind of loop pattern, dependences can not ...
    • Extending the Globus Information Service with the Common Information Model 

      Díaz, Iván; Fernández, G.; González, Patricia; Martín, María J.; Touriño, Juan (IEEE Computer Society, 2011-07-14)
      [Abstract] The need of task-adapted and complete information for the management of resources is a well known issue in Grid computing. Globus Toolkit 4 (GT4) includes the Monitoring and Discovery System component (MDS4) to ...
    • Free adaptive tessellation strategy of bézier surfaces 

      Concheiro, Raquel; Amor, Margarita; Bóo, Montserrat; Padrón, Emilio J. (SciTePress, 2014-01)
      [Abstract] Rendering of Bézier surfaces is currently performed by tessellating the model on the GPU and rendering the highly detailed triangle mesh. Whereas non-adaptive strategies apply the same tessellation pattern to ...
    • Guiding the Optimization of Parallel Codes on Multicores Using an Analytical Cache Model 

      Andrade, Diego; Fraguela, Basilio B.; Doallo, Ramón (2018)
      [Abstract]: Cache performance is particularly hard to predict in modern multicore processors as several threads can be concurrently in execution, and private cache levels are combined with shared ones. This paper presents ...
    • Hardware Implementation of Statecharts for FPGA-based Control in Scientific Facilities 

      Cereijo García, Javier; Osorio, Roberto (Institute of Electrical and Electronics Engineers, 2020-01-16)
      [Abstract] The problem of generating complex synchronization patterns using automated tools is addressed in this paper. This work was originally motivated by the need of fast and jitter free synchronization in scientific ...