Analysis and evaluation of MapReduce solutions on an HPC cluster

View/ Open
Use this link to cite
http://hdl.handle.net/2183/21697
Except where otherwise noted, this item's license is described as Atribución-NoComercial-SinDerivadas 3.0 España
Collections
- Investigación (FIC) [1685]
Metadata
Show full item recordTitle
Analysis and evaluation of MapReduce solutions on an HPC clusterDate
2016-02Citation
Jorge Veiga, Roberto R. Expósito, Guillermo L. Taboada, Juan Touriño, Analysis and evaluation of MapReduce solutions on an HPC cluster, Computers & Electrical Engineering, Volume 50, 2016, Pages 200-216, ISSN 0045-7906, https://doi.org/10.1016/j.compeleceng.2015.11.021. (http://www.sciencedirect.com/science/article/pii/S0045790615004127)
Abstract
[Abstract] The ever growing needs of Big Data applications are demanding challenging capabilities which cannot be handled easily by traditional systems, and thus more and more organizations are adopting High Performance Computing (HPC) to improve scalability and efficiency. Moreover, Big Data frameworks like Hadoop need to be adapted to leverage the available resources in HPC environments. This situation has caused the emergence of several HPC-oriented MapReduce frameworks, which benefit from different technologies traditionally oriented to supercomputing, such as high-performance interconnects or the message-passing interface. This work aims to establish a taxonomy of these frameworks together with a thorough evaluation, which has been carried out in terms of performance and energy efficiency metrics. Furthermore, the adaptability to emerging disks technologies, such as solid state drives, has been assessed. The results have shown that new frameworks like DataMPI can outperform Hadoop, although using IP over InfiniBand also provides significant benefits without code modifications.
Keywords
MapReduce
High performance computing (HPC)
Big Data
Energy efficiency
InfiniBand
Solid State Drive (SSD)
High performance computing (HPC)
Big Data
Energy efficiency
InfiniBand
Solid State Drive (SSD)
Description
This is a post-peer-review, pre-copyedit version of an article published in Computers & Electrical Engineering. The final authenticated version is available online at: https://doi.org/10.1016/j.compeleceng.2015.11.021
Editor version
Rights
Atribución-NoComercial-SinDerivadas 3.0 España
ISSN
0045-7906
1879-0755
1879-0755