• Accelerating the quality control of genetic sequences through stream processing 

      Castellanos Rodríguez, Óscar; Expósito, Roberto R.; Touriño, Juan (Association for Computing Machinery, 2023)
      [Abstract]: Quality control of DNA sequences is an important data preprocessing step in many genomic analyses. However, all existing parallel tools for this purpose are based on a batch processing model, needing to have ...
    • Enabling Hardware Affinity in JVM-Based Applications: A Case Study for Big Data 

      Expósito, Roberto R.; Veiga, Jorge; Touriño, Juan (Springer, 2020)
      [Abstract]: Java has been the backbone of Big Data processing for more than a decade due to its interesting features such as object orientation, cross-platform portability and good programming productivity. In fact, most ...
    • Evaluation of Java for General Purpose GPU Computing 

      Docampo, Jorge; Ramos Garea, Sabela; López Taboada, Guillermo; Expósito, Roberto R.; Touriño, Juan; Doallo, Ramón (IEEE Computer Society, 2013-07-01)
      [Abstract] The presence of many-core units as accelerators has been increasing due to their ability to improve the performance of highly parallel workloads. General Purpose GPU(GPGPU) computing has allowed the graphical ...
    • MREv: An Automatic MapReduce Evaluation Tool for Big Data Workloads 

      Veiga, Jorge; Expósito, Roberto R.; López Taboada, Guillermo; Touriño, Juan (Elsevier, 2015)
      [Abstract]: The popularity of Big Data computing models like MapReduce has caused the emergence of many frameworks oriented to High Performance Computing (HPC) systems. The suitability of each one to a particular use case ...
    • Performance Evaluation of Big Data Frameworks for Large-Scale Data Analytics 

      Veiga, Jorge; Expósito, Roberto R.; Pardo, Xoán C.; López Taboada, Guillermo; Touriño, Juan (IEEE Computer Society, 2017-02-06)
      [Abstract] The increasing adoption of Big Data analytics has led to a high demand for efficient technologies in order to manage and process large datasets. Popular MapReduce frameworks such as Hadoop are being replaced by ...
    • Performance Optimization of a Parallel Error Correction Tool 

      Martínez-Sánchez, Marco; Expósito, Roberto R.; Touriño, Juan (MDPI, 2021)
      [Abstract] Due to the continuous development in the field of Next Generation Sequencing (NGS) technologies that have allowed researchers to take advantage of greater genetic samples in less time, it is a matter of relevance ...
    • Power Budgeting of Big Data Applications in Container-based Clusters 

      Enes, Jonatan; Fieni, Guillaume; Expósito, Roberto R.; Rouvoy, Romain; Touriño, Juan (Institute of Electrical and Electronics Engineers, 2020-11-02)
      [Abstract] Energy consumption is currently highly regarded on computing systems for many reasons, such as improving the environmental impact and reducing operational costs considering the rising price of energy. Previous ...
    • RGen: Data Generator for Benchmarking Big Data Workloads 

      Pérez-Jove, Rubén; Expósito, Roberto R.; Touriño, Juan (MDPI, 2021)
      [Abstract] This paper presents RGen, a parallel data generator for benchmarking Big Data workloads, which integrates existing features and new functionalities in a standalone tool. The main functionalities developed in ...