• A PVM Based Library for Sparse Matrix Factorizations 

      Touriño, Juan; Doallo, Ramón (Springer, 1998)
      [Abstract] We present 3LM, a C Linked List Management Library for parallel sparse factorizations on a PVM environment which takes into account the fill-in, an important drawback of sparse computations. It is restricted to ...
    • An Inspector-Executor Algorithm for Irregular Assignment Parallelization 

      Arenaz Silva, Manuel; Touriño, Juan; Doallo, Ramón (Springer, 2004)
      [Abstract] A loop with irregular assignment computations contains loop-carried output data dependences that can only be detected at run-time. In this paper, a load-balanced method based on the inspector-executor model is ...
    • Characterization of message-passing overhead on the AP3000 multicomputer 

      Touriño, Juan; Doallo, Ramón (IEEE, 2001-09)
      [Abstract] The performance of the communication primitives of parallel computers is critical for the overall system performance. The characterization of the communication overhead is very important to estimate the global ...
    • Compiler support for parallel code generation through kernel recognition 

      Arenaz Silva, Manuel; Touriño, Juan; Doallo, Ramón (IEEE Computer Society, 2004-06-07)
      [Abstract] Summary form only given. The automatic parallelization of loops that contain complex computations is still a challenge for current parallelizing compilers. The main limitations are related to the analysis of ...
    • Design and Implementation of MapReduce using the PGAS Programming Model with UPC 

      Teijeiro Barjas, Carlos; López Taboada, Guillermo; Touriño, Juan; Doallo, Ramón (IEEE Computer Society, 2012-01-03)
      [Abstract] MapReduce is a powerful tool for processing large data sets used by many applications running in distributed environments. However, despite the increasing number of computationally intensive problems that require ...
    • Efficient Java Communication Protocols on High-speed Cluster Interconnects 

      López Taboada, Guillermo; Touriño, Juan; Doallo, Ramón (IEEE Computer Society, 2007-02-26)
      [Abstract] This paper presents communication strategies for achieving efficient parallel and distributed Java applications on clusters with high-speed interconnects. Communication performance is critical for the overall ...
    • Evaluation of Java for General Purpose GPU Computing 

      Docampo, Jorge; Ramos Garea, Sabela; López Taboada, Guillermo; Expósito, Roberto R.; Touriño, Juan; Doallo, Ramón (IEEE Computer Society, 2013-07-01)
      [Abstract] The presence of many-core units as accelerators has been increasing due to their ability to improve the performance of highly parallel workloads. General Purpose GPU(GPGPU) computing has allowed the graphical ...
    • Evaluation of Parallel Differential Evolution Implementations on MapReduce and Spark 

      Teijeiro, Diego; Pardo, Xoán C.; Penas, David R.; González, Patricia; Banga, Julio R.; Doallo, Ramón (Springer, 2017-09)
      [Abstract] Global optimization problems arise in many areas of science and engineering, computational and systems biology and bioinformatics among them. Many research efforts have focused on developing parallel metaheuristics ...
    • Guiding the Optimization of Parallel Codes on Multicores Using an Analytical Cache Model 

      Andrade, Diego; Fraguela, Basilio B.; Doallo, Ramón (2018)
      [Abstract]: Cache performance is particularly hard to predict in modern multicore processors as several threads can be concurrently in execution, and private cache levels are combined with shared ones. This paper presents ...
    • HPF-2 Support for Dynamic Sparse Computations 

      Asenjo Plaza, Rafael; Plata, Oscar; Touriño, Juan; Zapata, Emilio L.; Doallo, Ramón (Springer, 1998)
      [Abstract] There is a class of sparse matrix computations, such as direct solvers of systems of linear equations, that change the fill-in (nonzero entries) of the coefficient matrix, and involve row and column operations ...
    • Multimethod Optimization for Reverse Engineering of Complex Biological Networks 

      González, Patricia; Penas, David R.; Pardo, Xoán C.; Banga, Julio R.; Doallo, Ramón (Association for Computing Machinery, 2018-09)
      [Abstract] Optimization problems appears in different areas of science and engineering. This paper considers the general problem of reverse engineering in computational biology by means of mixed-integer nonlinear dynamic ...
    • Non-blocking Java Communications Support on Clusters 

      López Taboada, Guillermo; Touriño, Juan; Doallo, Ramón (Springer, 2006)
      [Abstract] This paper presents communication strategies for supporting efficient non-blocking Java communication on clusters. The communication performance is critical for the overall cluster performance. It is possible ...
    • NPB-MPJ: NAS Parallel Benchmarks Implementation for Message-Passing in Java 

      Mallón, Damián A.; López Taboada, Guillermo; Touriño, Juan; Doallo, Ramón (IEEE Computer Society, 2009-05-08)
      [Abstract] Java is a valuable and emerging alternative for the development of parallel applications, thanks to the availability of several Java message-passing libraries and its full multithreading support. The combination ...
    • Ontological Con guration Management for Wireless Mesh Routers 

      Díaz, Iván; Popi, Cristian; Festor, Olivier; Touriño, Juan; Doallo, Ramón (Springer, 2009)
      [Abstract] Wireless mesh networks (WMNs) are a category of wireless networks that are self-organized, robust and which offer more flexible client coverage with less equipment requirements than wired networks. In WMNs, mesh ...
    • Parallel Sparse Modified Gram-Schmidt QR Decomposition 

      Doallo, Ramón; Fraguela, Basilio B.; Touriño, Juan; Zapata, Emilio L. (Springer, 1996)
      [Abstract] We present a parallel computational method for the QR decomposition with column pivoting of a sparse matrix by means of Modified Gram-Schmidt orthogonalization. Nonzero elements of the matrix M to be decomposed ...
    • Performance analysis of Java message-passing libraries on fast Ethernet, Myrinet and SCI clusters 

      López Taboada, Guillermo; Touriño, Juan; Doallo, Ramón (IEEE Computer Society, 2004-01-08)
      [Abstract] The use of Java for parallel programming on clusters according to the message-passing paradigm is an attractive choice. In this case, the overall application performance will largely depend on the performance ...
    • Performance Evaluation of MPI, UPC and OpenMP on Multicore Architectures 

      Mallón, Damián A.; López Taboada, Guillermo; Teijeiro Barjas, Carlos; Touriño, Juan; Fraguela, Basilio B.; Gómez, Andrés; Doallo, Ramón; Mouriño, José C. (Springer, 2009)
      [Abstract] The current trend to multicore architectures underscores the need of parallelism. While new languages and alternatives for supporting more efficiently these systems are proposed, MPI faces this new challenge. ...
    • Performance Evaluation of Unified Parallel C Collective Communications 

      López Taboada, Guillermo; Teijeiro Barjas, Carlos; Touriño, Juan; Fraguela, Basilio B.; Doallo, Ramón; Mouriño, José C.; Mallón, Damián A.; Gómez, Andrés (IEEE Computer Society, 2009-07-17)
      [Abstract] Unified Parallel C (UPC) is an extension of ANSI C designed for parallel programming. UPC collective primitives, which are part of the UPC standard, increase programming productivity while reducing the communication ...
    • Program Behavior Characterization Through Advanced Kernel Recognition 

      Arenaz Silva, Manuel; Touriño, Juan; Doallo, Ramón (Springer, 2007)
      [Abstract] Understanding program behavior is at the foundation of program optimization. Techniques for automatic recognition of program constructs (from now on, computational kernels) characterize the behavior of program ...
    • Sparse Givens QR Factorization on a Multiprocessor 

      Touriño, Juan; Doallo, Ramón; Zapata, E.L. (IEEE Computer Society, 1996-06-01)
      [Abstract] We present a parallel algorithm for the QR factorization with column pivoting of a sparse matrix by means of Givens rotations. Nonzero elements of the matrix M to be decomposed are stored in a one dimensional ...