Flame-MR: An event-driven architecture for MapReduce applications

View/ Open
Use this link to cite
http://hdl.handle.net/2183/20884
Except where otherwise noted, this item's license is described as Atribución-NoComercial-SinDerivadas 3.0 España
Collections
- Investigación (FIC) [1656]
Metadata
Show full item recordTitle
Flame-MR: An event-driven architecture for MapReduce applicationsDate
2016Citation
Veiga, J., Expósito, R. R., Taboada, G. L., & Tourino, J. (2016). Flame-MR: an event-driven architecture for MapReduce applications. Future Generation Computer Systems, 65, 46-56.
Abstract
[Abstract] Nowadays, many organizations analyze their data with the MapReduce paradigm, most of them using the popular Apache Hadoop framework. As the data size managed by MapReduce applications is steadily increasing, the need for improving the Hadoop performance also grows. Existing modifications of Hadoop (e.g., Mellanox Unstructured Data Accelerator) attempt to improve performance by changing some of its underlying subsystems. However, they are not always capable to cope with all its performance bottlenecks or they hinder its portability. Furthermore, new frameworks like Apache Spark or DataMPI can achieve good performance improvements, but they do not keep compatibility with existing MapReduce applications. This paper proposes Flame-MR, a new event-driven MapReduce architecture that increases Hadoop performance by avoiding memory copies and pipelining data movements, without modifying the source code of the applications. The performance evaluation on two representative systems (an HPC cluster and a public cloud platform) has shown experimental evidence of significant performance increases, reducing the execution time by up to 54% on the Amazon EC2 cloud.
Keywords
Big Data
MapReduce
Hadoop
Event-driven architecture
Cloud computing
MapReduce
Hadoop
Event-driven architecture
Cloud computing
Editor version
Rights
Atribución-NoComercial-SinDerivadas 3.0 España
ISSN
0167-739X
1872-7115
1872-7115