Search
Now showing items 1-10 of 43
Performance Optimization of a Parallel Error Correction Tool
(MDPI, 2021)
[Abstract] Due to the continuous development in the field of Next Generation Sequencing (NGS) technologies that have allowed researchers to take advantage of greater genetic samples in less time, it is a matter of relevance ...
MREv: An Automatic MapReduce Evaluation Tool for Big Data Workloads
(Elsevier, 2015)
[Abstract]: The popularity of Big Data computing models like MapReduce has caused the emergence of many frameworks oriented to High Performance Computing (HPC) systems. The suitability of each one to a particular use case ...
Enabling Hardware Affinity in JVM-Based Applications: A Case Study for Big Data
(Springer, 2020)
[Abstract]: Java has been the backbone of Big Data processing for more than a decade due to its interesting features such as object orientation, cross-platform portability and good programming productivity. In fact, most ...
Power Budgeting of Big Data Applications in Container-based Clusters
(Institute of Electrical and Electronics Engineers, 2020-11-02)
[Abstract]
Energy consumption is currently highly regarded on computing systems for many reasons, such as improving the environmental impact and reducing operational costs considering the rising price of energy. Previous ...
CUDA-JMI: Acceleration of feature selection on heterogeneous systems
(Elsevier, 2020-01)
[Abstract]: Feature selection is a crucial step nowadays in machine learning and data analytics to remove irrelevant and redundant characteristics and thus to provide fast and reliable analyses. Many research works have ...
Serverless-like platform for container-based YARN clusters
(Elsevier, 2024-06)
[Abstract]: Serverless computing is an emerging paradigm that has gained a lot of relevance in recent years, as it allows users to consume computing resources without worrying about the underlying infrastructure and pay ...
BigDEC: A multi-algorithm Big Data tool based on the k-mer spectrum method for scalable short-read error correction
(Elsevier, 2024-05)
[Abstract]: Despite the significant improvements in both throughput and cost provided by modern Next-Generation Sequencing (NGS) platforms, sequencing errors in NGS datasets can still degrade the quality of downstream ...
Running scientific codes on amazon EC2: a performance analysis of five high-end instances
(Springer New York LLC, 2013)
[Abstract] Amazon Web Services (AWS) is a well-known public Infrastructure-as-a-Service (IaaS) provider whose Elastic Computing Cloud (EC2) o ering includes some instances, known as cluster instances, aimed at High-Performance ...
Analysis of I/O Performance on an Amazon EC2 Cluster Compute and High I/O Platform
(Springer Netherlands, 2013-12)
[Abstract] Cloud computing is currently being explored by the scientific community to assess its suitability for High Performance Computing (HPC) environments. In this novel paradigm, compute and storage resources, as well ...
Accelerating the quality control of genetic sequences through stream processing
(Association for Computing Machinery, 2023)
[Abstract]: Quality control of DNA sequences is an important data preprocessing step in many genomic analyses. However, all existing parallel tools for this purpose are based on a batch processing model, needing to have ...