Analysis and evaluation of MapReduce solutions on an HPC cluster

Ver/Abrir
Use este enlace para citar
http://hdl.handle.net/2183/21697
Excepto si se señala otra cosa, la licencia del ítem se describe como Atribución-NoComercial-SinDerivadas 3.0 España
Colecciones
- Investigación (FIC) [1685]
Metadatos
Mostrar el registro completo del ítemTítulo
Analysis and evaluation of MapReduce solutions on an HPC clusterFecha
2016-02Cita bibliográfica
Jorge Veiga, Roberto R. Expósito, Guillermo L. Taboada, Juan Touriño, Analysis and evaluation of MapReduce solutions on an HPC cluster, Computers & Electrical Engineering, Volume 50, 2016, Pages 200-216, ISSN 0045-7906, https://doi.org/10.1016/j.compeleceng.2015.11.021. (http://www.sciencedirect.com/science/article/pii/S0045790615004127)
Resumen
[Abstract] The ever growing needs of Big Data applications are demanding challenging capabilities which cannot be handled easily by traditional systems, and thus more and more organizations are adopting High Performance Computing (HPC) to improve scalability and efficiency. Moreover, Big Data frameworks like Hadoop need to be adapted to leverage the available resources in HPC environments. This situation has caused the emergence of several HPC-oriented MapReduce frameworks, which benefit from different technologies traditionally oriented to supercomputing, such as high-performance interconnects or the message-passing interface. This work aims to establish a taxonomy of these frameworks together with a thorough evaluation, which has been carried out in terms of performance and energy efficiency metrics. Furthermore, the adaptability to emerging disks technologies, such as solid state drives, has been assessed. The results have shown that new frameworks like DataMPI can outperform Hadoop, although using IP over InfiniBand also provides significant benefits without code modifications.
Palabras clave
MapReduce
High performance computing (HPC)
Big Data
Energy efficiency
InfiniBand
Solid State Drive (SSD)
High performance computing (HPC)
Big Data
Energy efficiency
InfiniBand
Solid State Drive (SSD)
Descripción
This is a post-peer-review, pre-copyedit version of an article published in Computers & Electrical Engineering. The final authenticated version is available online at: https://doi.org/10.1016/j.compeleceng.2015.11.021
Versión del editor
Derechos
Atribución-NoComercial-SinDerivadas 3.0 España
ISSN
0045-7906
1879-0755
1879-0755