Buscar
Mostrando ítems 1-10 de 36
Optimization of Real-World MapReduce Applications With Flame-MR: Practical Use Cases
(Institute of Electrical and Electronics Engineers, 2018-11-12)
[Abstract] Apache Hadoop is a widely used MapReduce framework for storing and processing large amounts of data. However, it presents some performance issues that hinder its utilization in many practical use cases. Although ...
Big Data-Oriented PaaS Architecture with Disk-as-a-Resource Capability and Container-Based Virtualization
(Springer Netherlands, 2018-12)
[Abstract] With the increasing adoption of Big Data technologies as basic tools for the ongoing Digital Transformation, there is a high demand for data-intensive applications. In order to efficiently execute such applications, ...
BDEv 3.0: energy efficiency and microarchitectural characterization of Big Data processing frameworks
(Elsevier BV * North-Holland, 2018-09)
[Abstract] As the size of Big Data workloads keeps increasing, the evaluation of distributed frameworks becomes a crucial task in order to identify potential performance bottlenecks that may delay the processing of large ...
BDWatchdog: real-time monitoring and profiling of Big Data applications and frameworks
(Elsevier BV * North-Holland, 2018-10)
[Abstract] Current Big Data applications are characterized by a heavy use of system resources (e.g., CPU, disk) generally distributed across a cluster. To effectively improve their performance there is a critical need for ...
SparkEC: speeding up alignment-based DNA error correction tools
(BioMed Central (Springer), 2022)
[Abstract]: In recent years, huge improvements have been made in the context of sequencing genomic data under what is called Next Generation Sequencing (NGS). However, the DNA reads generated by current NGS platforms are ...
RGen: Data Generator for Benchmarking Big Data Workloads
(MDPI, 2021)
[Abstract] This paper presents RGen, a parallel data generator for benchmarking Big Data workloads, which integrates existing features and new functionalities in a standalone tool. The main functionalities developed in ...
Performance Optimization of a Parallel Error Correction Tool
(MDPI, 2021)
[Abstract] Due to the continuous development in the field of Next Generation Sequencing (NGS) technologies that have allowed researchers to take advantage of greater genetic samples in less time, it is a matter of relevance ...
FastMPJ: a scalable and efficient Java message-passing library
(Springer New York LLC, 2014)
[Abstract] The performance and scalability of communications are key for high performance computing (HPC) applications in the current multi-core era. Despite the significant benefits (e.g., productivity, portability, ...
Enhancing in-memory Efficiency for MapReduce-based Data Processing
(Academic Press, 2018-10)
[Abstract] As the memory capacity of computational systems increases, the in-memory data management of Big Data processing frameworks becomes more crucial for performance. This paper analyzes and improves the memory ...
Design of scalable Java message-passing communications over InfiniBand
(Springer New York LLC, 2012-07)
[Abstract] This paper presents ibvdev a scalable and efficient low-level Java message-passing communication device over InfiniBand. The continuous increase in the number of cores per processor underscores the need for ...