Buscar
Mostrando ítems 1-10 de 25
Optimization of Real-World MapReduce Applications With Flame-MR: Practical Use Cases
(Institute of Electrical and Electronics Engineers, 2018-11-12)
[Abstract] Apache Hadoop is a widely used MapReduce framework for storing and processing large amounts of data. However, it presents some performance issues that hinder its utilization in many practical use cases. Although ...
Big Data-Oriented PaaS Architecture with Disk-as-a-Resource Capability and Container-Based Virtualization
(Springer Netherlands, 2018-12)
[Abstract] With the increasing adoption of Big Data technologies as basic tools for the ongoing Digital Transformation, there is a high demand for data-intensive applications. In order to efficiently execute such applications, ...
BDEv 3.0: energy efficiency and microarchitectural characterization of Big Data processing frameworks
(Elsevier BV * North-Holland, 2018-09)
[Abstract] As the size of Big Data workloads keeps increasing, the evaluation of distributed frameworks becomes a crucial task in order to identify potential performance bottlenecks that may delay the processing of large ...
BDWatchdog: real-time monitoring and profiling of Big Data applications and frameworks
(Elsevier BV * North-Holland, 2018-10)
[Abstract] Current Big Data applications are characterized by a heavy use of system resources (e.g., CPU, disk) generally distributed across a cluster. To effectively improve their performance there is a critical need for ...
MarDRe: efficient MapReduce-based removal of duplicate DNA reads in the cloud
(Oxford University Press, 2017)
[Abstract] This article presents MarDRe, a de novo cloud-ready duplicate and near-duplicate removal tool that can process single- and paired-end reads from FASTQ/FASTA datasets. MarDRe takes advantage of the widely adopted ...
HSRA: Hadoop-based spliced read aligner for RNA sequencing data
(Public Library of Science, 2018-07-31)
[Abstract] Nowadays, the analysis of transcriptome sequencing (RNA-seq) data has become the standard method for quantifying the levels of gene expression. In RNA-seq experiments, the mapping of short reads to a reference ...
Analysis of I/O Performance on an Amazon EC2 Cluster Compute and High I/O Platform
(Springer Netherlands, 2013-12)
[Abstract] Cloud computing is currently being explored by the scientific community to assess its suitability for High Performance Computing (HPC) environments. In this novel paradigm, compute and storage resources, as well ...
Analysis and evaluation of MapReduce solutions on an HPC cluster
(Pergamon Press, 2016-02)
[Abstract] The ever growing needs of Big Data applications are demanding challenging capabilities which cannot be handled easily by traditional systems, and thus more and more organizations are adopting High Performance ...
Low‐latency Java communication devices on RDMA‐enabled networks
(John Wiley & Sons Ltd., 2015)
[Abstract] Providing high‐performance inter‐node communication is a key capability for running high performance computing applications efficiently on parallel architectures. In fact, current systems deployments are aggregating ...
Performance Evaluation of Data-Intensive Computing Applications on a Public IaaS Cloud
(Oxford University Press, 2016)
[Abstract] The advent of cloud computing technologies, which dynamically provide on-demand access to computational resources over the Internet, is offering new possibilities to many scientists and researchers. Nowadays, ...