Skip navigation
  •  Home
  • UDC 
    • Getting started
    • RUC Policies
    • FAQ
    • FAQ on Copyright
    • More information at INFOguias UDC
  • Browse 
    • Communities
    • Browse by:
    • Issue Date
    • Author
    • Title
    • Subject
  • Help
    • español
    • Gallegan
    • English
  • Login
  •  English 
    • Español
    • Galego
    • English
  
View Item 
  •   DSpace Home
  • Facultade de Informática
  • Investigación (FIC)
  • View Item
  •   DSpace Home
  • Facultade de Informática
  • Investigación (FIC)
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Performance Evaluation of Data-Intensive Computing Applications on a Public IaaS Cloud

Thumbnail
View/Open
Expósito_R.R._Performance Evaluation of Data-Intensive Computing Applications_2016.pdf (990.9Kb)
Use this link to cite
http://hdl.handle.net/2183/20849
Collections
  • Investigación (FIC) [1678]
Metadata
Show full item record
Title
Performance Evaluation of Data-Intensive Computing Applications on a Public IaaS Cloud
Author(s)
Expósito, Roberto R.
Taboada, Guillermo L.
Ramos Garea, Sabela
Touriño, Juan
Doallo, Ramón
Date
2016
Citation
Expósito, R. R., Taboada, G. L., Ramos, S., Touriño, J., & Doallo, R. (2016). Performance evaluation of data-intensive computing applications on a public IaaS cloud. The Computer Journal, 59(3), 287-307.
Abstract
[Abstract] The advent of cloud computing technologies, which dynamically provide on-demand access to computational resources over the Internet, is offering new possibilities to many scientists and researchers. Nowadays, Infrastructure as a Service (IaaS) cloud providers can offset the increasing processing requirements of data-intensive computing applications, becoming an emerging alternative to traditional servers and clusters. In this paper, a comprehensive study of the leading public IaaS cloud platform, Amazon EC2, has been conducted in order to assess its suitability for data-intensive computing. One of the key contributions of this work is the analysis of the storage-optimized family of EC2 instances. Furthermore, this study presents a detailed analysis of both performance and cost metrics. More specifically, multiple experiments have been carried out to analyze the full I/O software stack, ranging from the low-level storage devices and cluster file systems up to real-world applications using representative data-intensive parallel codes and MapReduce-based workloads. The analysis of the experimental results has shown that data-intensive applications can benefit from tailored EC2-based virtual clusters, enabling users to obtain the highest performance and cost-effectiveness in the cloud.
Keywords
Data intensive computing
Cloud computing
Infrastructure as a service
Amazon EC2
Cluster file system
MapReduce
 
Description
This is a post-peer-review, pre-copyedit version of an article published in The Computer Journal. The final authenticated version is available online at: https://doi.org/10.1093/comjnl/bxu111
Editor version
https://doi.org/10.1093/comjnl/bxu111
ISSN
0010-4620
1460-2067
 

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsResearch GroupAcademic DegreeThis CollectionBy Issue DateAuthorsTitlesSubjectsResearch GroupAcademic Degree

My Account

LoginRegister

Statistics

View Usage Statistics
Sherpa
OpenArchives
OAIster
Scholar Google
UNIVERSIDADE DA CORUÑA. Servizo de Biblioteca.    DSpace Software Copyright © 2002-2013 Duraspace - Send Feedback