• MarDRe: efficient MapReduce-based removal of duplicate DNA reads in the cloud 

      Expósito, Roberto R.; Veiga, Jorge; González-Domínguez, Jorge; Touriño, Juan (Oxford University Press, 2017)
      [Abstract] This article presents MarDRe, a de novo cloud-ready duplicate and near-duplicate removal tool that can process single- and paired-end reads from FASTQ/FASTA datasets. MarDRe takes advantage of the widely adopted ...
    • Mobile Robot Positioning with 433-MHz Wireless Motes with Varying Transmission Powers and a Particle Filter 

      Canedo-Rodríguez, Adrián; Rodríguez, José Manuel; Álvarez-Santos, Víctor; Iglesias, Roberto; Regueiro, Carlos V. (Multidisciplinary Digital Publishing Institute, 2015)
      In wireless positioning systems, the transmitter’s power is usually fixed. In this paper, we explore the use of varying transmission powers to increase the performance of a wireless localization system. To this extent, we ...
    • ModelTest-NG: A New and Scalable Tool for the Selection of DNA and Protein Evolutionary Models 

      Darriba, Diego; Posada, David; Kozlov, Alexey M.; Stamatakis, Alexandros; Morel, Benoit; Flouri, Tomas (Oxford University Press, 2019-08-21)
      [Abstract] ModelTest-NG is a reimplementation from scratch of jModelTest and ProtTest, two popular tools for selecting the best-fit nucleotide and amino acid substitution models, respectively. ModelTest-NG is one to two ...
    • MPI and UPC broadcast, scatter and gather algorithms in Xeon Phi 

      Mallón, Damián A.; Taboada, Guillermo L.; Koesterke, Lars (John Wiley & Sons Ltd., 2016-05-06)
      [Abstract] Accelerators have revolutionised the high performance computing (HPC) community. Despite their advantages, their very specific programming models and limited communication capabilities have kept them in a ...
    • MPI-dot2dot: A Parallel Tool to Find DNA Tandem Repeats on Multicore Clusters 

      González-Domínguez, Jorge; Martín Martínez, José Manuel; Expósito, Roberto R. (Springer, 2022)
      [Abstract] Tandem Repeats (TRs) are segments that occur several times in a DNA sequence, and each copy is adjacent to other. In the last few years, TRs have gained significant attention as they are thought to be related ...
    • MREv: An Automatic MapReduce Evaluation Tool for Big Data Workloads 

      Veiga, Jorge; Expósito, Roberto R.; Taboada, Guillermo L.; Touriño, Juan (Elsevier, 2015)
      [Abstract]: The popularity of Big Data computing models like MapReduce has caused the emergence of many frameworks oriented to High Performance Computing (HPC) systems. The suitability of each one to a particular use case ...
    • MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems 

      González-Domínguez, Jorge; Liu, Yongchao; Touriño, Juan; Schmidt, Bertil (Oxford University Press, 2016)
      [Abstracts] MSAProbs is a state-of-the-art protein multiple sequence alignment tool based on hidden Markov models. It can achieve high alignment accuracy at the expense of relatively long runtimes for large-scale input ...
    • Multimethod Optimization for Reverse Engineering of Complex Biological Networks 

      González, Patricia; Penas, David R.; Pardo, Xoán C.; Banga, Julio R.; Doallo, Ramón (Association for Computing Machinery, 2018-09)
      [Abstract] Optimization problems appears in different areas of science and engineering. This paper considers the general problem of reverse engineering in computational biology by means of mixed-integer nonlinear dynamic ...
    • Multimethod optimization in the cloud: A case‐study in systems biology modelling 

      González, Patricia; Penas, David R.; Pardo, Xoán C.; Banga, Julio R.; Doallo, Ramón (Wiley, 2018-06-25)
      [Abstract] Optimization problems appear in many different applications in science and engineering. A large number of different algorithms have been proposed for solving them; however, there is no unique general optimization ...
    • Multithreaded and Spark parallelization of feature selection filters 

      Eiras-Franco, Carlos; Bolón-Canedo, Verónica; Ramos Garea, Sabela; González-Domínguez, Jorge; Alonso-Betanzos, Amparo; Touriño, Juan (2016)
      [Abstract]: Vast amounts of data are generated every day, constituting a volume that is challenging to analyze. Techniques such as feature selection are advisable when tackling large datasets. Among the tools that provide ...
    • Non-blocking Java Communications Support on Clusters 

      Taboada, Guillermo L.; Touriño, Juan; Doallo, Ramón (Springer, 2006)
      [Abstract] This paper presents communication strategies for supporting efficient non-blocking Java communication on clusters. The communication performance is critical for the overall cluster performance. It is possible ...
    • Non-IID data and Continual Learning processes in Federated Learning: A long road ahead 

      Criado, Marcos F.; Casado, Fernando E.; Iglesias Rodríguez, Roberto; Regueiro, Carlos V.; Barro, Senén (Elsevier, 2022)
      [Abstract] Federated Learning is a novel framework that allows multiple devices or institutions to train a machine learning model collaboratively while preserving their data private. This decentralized approach is prone ...
    • Nonblocking collectives for scalable Java communications 

      Ramos Garea, Sabela; Taboada, Guillermo L.; Expósito, Roberto R.; Touriño, Juan (John Wiley & Sons Ltd., 2015-04-22)
      [Abstract] This paper presents a Java implementation of the recently published MPI 3.0 nonblocking message passing collectives in order to analyze and assess the feasibility of taking advantage of these operations in shared ...
    • Novel parallelization of simulated annealing and Hooke & Jeeves search algorithms for multicore systems with application to complex fisheries stock assessment models 

      Vázquez Pardo, Sergio; Martín, María J.; Fraguela, Basilio B.; Gómez, Andrés; Rodríguez, Aurelio; Elvarsson, Bjarki Þór (Elsevier Ltd, 2016-11)
      [Abstract] Estimating parameters of a statistical fisheries assessment model typically involves a comparison of disparate datasets to a forward simulation model through a likelihood function. In all but trivial cases the ...
    • NPB-MPJ: NAS Parallel Benchmarks Implementation for Message-Passing in Java 

      Mallón, Damián A.; Taboada, Guillermo L.; Touriño, Juan; Doallo, Ramón (IEEE Computer Society, 2009-05-08)
      [Abstract] Java is a valuable and emerging alternative for the development of parallel applications, thanks to the availability of several Java message-passing libraries and its full multithreading support. The combination ...
    • Numerical Simulation of Pollutant Transport in a Shallow-Water System on the Cell Heterogeneous Processor 

      González, Carlos H.; Fraguela, Basilio B.; Andrade, Diego; García Rodríguez, José Antonio; Castro, M.J. (Springer, 2013)
      [Abstract] This paper presents an implementation, optimized for the Cell processor, of a finite volume numerical scheme for 2D shallow-water systems with pollutant transport. A description of the special architecture and ...
    • On processing extreme data 

      Petcu, Dana; Iuhasz, Gabriel; Pop, Daniel; Talia, Domenico; Carretero, Jesús; Prodan, Radu; Fahringer, Thomas; Grasso, Ivan; Doallo, Ramón; Martín, María J.; Fraguela, Basilio B.; Trobec, Roman; Depolli, Matjaz; Almeida Rodriguez, Francisco; Sande, Francisco de; Da Costa, Georges; Pierson, Jean-Marc; Anastasiadis, Stergios; Bartzokas, Aristides; Lolis, Christos; Gonçalves, Pedro; Brito, Fabrice; Brown, Nick (Universitatea de Vest din Timisoara,West University of Timisoara, 2016)
      [Abstract] Extreme Data is an incarnation of Big Data concept distinguished by the massive amounts of data that must be queried, communicated and analyzed in near real-time by using a very large number of memory or storage ...
    • Ontological Con guration Management for Wireless Mesh Routers 

      Díaz, Iván; Popi, Cristian; Festor, Olivier; Touriño, Juan; Doallo, Ramón (Springer, 2009)
      [Abstract] Wireless mesh networks (WMNs) are a category of wireless networks that are self-organized, robust and which offer more flexible client coverage with less equipment requirements than wired networks. In WMNs, mesh ...
    • OpenCNN: A Winograd Minimal Filtering Algorithm Implementation in CUDA 

      López Castro, Roberto; Andrade, Diego; Fraguela, Basilio B. (MDPI, 2021)
      [Abstract] Improving the performance of the convolution operation has become a key target for High Performance Computing (HPC) developers due to its prevalence in deep learning applied mainly to video processing. The ...
    • Optimization of Real-World MapReduce Applications With Flame-MR: Practical Use Cases 

      Veiga, Jorge; Expósito, Roberto R.; Raffin, Bruno; Touriño, Juan (Institute of Electrical and Electronics Engineers, 2018-11-12)
      [Abstract] Apache Hadoop is a widely used MapReduce framework for storing and processing large amounts of data. However, it presents some performance issues that hinder its utilization in many practical use cases. Although ...