Big Data-Oriented PaaS Architecture with Disk-as-a-Resource Capability and Container-Based Virtualization

UDC.coleccionInvestigaciónes_ES
UDC.departamentoEnxeñaría de Computadoreses_ES
UDC.endPage605es_ES
UDC.grupoInvGrupo de Arquitectura de Computadores (GAC)es_ES
UDC.issue4es_ES
UDC.journalTitleJournal of Grid Computinges_ES
UDC.startPage587es_ES
UDC.volume16es_ES
dc.contributor.authorLópez Cacheiro, Javier
dc.contributor.authorExpósito, Roberto R.
dc.contributor.authorTouriño, Juan
dc.contributor.authorEnes, Jonatan
dc.date.accessioned2019-02-14T16:33:21Z
dc.date.embargoEndDate2020-01-01es_ES
dc.date.embargoLift2020-01-01
dc.date.issued2018-12
dc.descriptionThis is a post-peer-review, pre-copyedit version of an article published in Journal of Grid Computing. The final authenticated version is available online at: https://doi.org/10.1007/s10723-018-9460-4es_ES
dc.description.abstract[Abstract] With the increasing adoption of Big Data technologies as basic tools for the ongoing Digital Transformation, there is a high demand for data-intensive applications. In order to efficiently execute such applications, it is vital that cloud providers change the way hardware infrastructure resources are managed to improve their performance. However, the increasing use of virtualization technologies to achieve an efficient usage of infrastructure resources continuously widens the gap between applications and the underlying hardware, thus decreasing resource efficiency for the end user. Moreover, this scenario is especially troublesome for Big Data applications, as storage resources are one of the most heavily virtualized, thus imposing a significant overhead for large-scale data processing. This paper proposes a novel PaaS architecture specifically oriented for Big Data where the scheduler offers disks as resources alongside the more common CPU and memory resources, looking forward to provide a better storage solution for the user. Furthermore, virtualization overheads are reduced to the bare minimum by replacing heavy hypervisor-based technologies with operating-system-level virtualization based on light software containers. This architecture has been deployed on a Big Data infrastructure at the CESGA supercomputing center, used as a testbed to compare its performance with OpenStack, a popular private cloud platform. Results have shown significant performance improvements, reducing the execution time of representative Big Data workloads by up to 4.5×.es_ES
dc.description.sponsorshipMinisterio de Economía, Industria y Competitividad; TIN2016-75845-P, AEI/FEDER, EUes_ES
dc.description.sponsorshipMinisterio de Educación; FPU15/03381es_ES
dc.identifier.citationEnes, J., Cacheiro, J.L., Expósito, R.R. et al. J Grid Computing (2018) 16: 587. https://doi.org/10.1007/s10723-018-9460-4es_ES
dc.identifier.doi10.1007/s10723-018-9460-4
dc.identifier.issn1570-7873
dc.identifier.issn1572-9184
dc.identifier.urihttp://hdl.handle.net/2183/21763
dc.language.isoenges_ES
dc.publisherSpringer Netherlandses_ES
dc.relation.urihttps://doi.org/10.1007/s10723-018-9460-4es_ES
dc.rights.accessRightsopen accesses_ES
dc.subjectBig Dataes_ES
dc.subjectPlatform as a Service (PaaS)es_ES
dc.subjectCloud computinges_ES
dc.subjectDisk-as-a-resource schedulinges_ES
dc.subjectOperating-system-level virtualizationes_ES
dc.titleBig Data-Oriented PaaS Architecture with Disk-as-a-Resource Capability and Container-Based Virtualizationes_ES
dc.typejournal articlees_ES
dspace.entity.typePublication
relation.isAuthorOfPublication6a6967e9-a4f5-4006-afee-4fc9d5f3a658
relation.isAuthorOfPublication86e306a5-99a1-4c43-8faa-720f0a9f0a34
relation.isAuthorOfPublicationd68e4e4d-e41b-45fb-8b6d-80dc86429b02
relation.isAuthorOfPublication.latestForDiscovery6a6967e9-a4f5-4006-afee-4fc9d5f3a658

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
J.Enes_2018_Big_Data-Oriented _aaS_Architecture.pdf
Size:
2.51 MB
Format:
Adobe Portable Document Format
Description: