Mostrar o rexistro simple do ítem

dc.contributor.authorPérez-Jove, Rubén
dc.contributor.authorExpósito, Roberto R.
dc.contributor.authorTouriño, Juan
dc.date.accessioned2022-01-19T19:36:15Z
dc.date.available2022-01-19T19:36:15Z
dc.date.issued2021
dc.identifier.citationPérez-Jove, R.; Expósito, R.R.; Touriño, J. RGen: Data Generator for Benchmarking Big Data Workloads. Eng. Proc. 2021, 7, 13. https://doi.org/10.3390/engproc2021007013es_ES
dc.identifier.urihttp://hdl.handle.net/2183/29447
dc.descriptionPresented at the 4th XoveTIC Conference, A Coruña, Spain, 7–8 October 2021.es_ES
dc.description.abstract[Abstract] This paper presents RGen, a parallel data generator for benchmarking Big Data workloads, which integrates existing features and new functionalities in a standalone tool. The main functionalities developed in this work were the generation of text and graphs that meet the characteristics defined by the 4 Vs of Big Data. On the one hand, the LDA model has been used for text generation, which extracts topics or themes covered in a series of documents. On the other hand, graph generation is based on the Kronecker model. The experimental evaluation carried out on a 16-node cluster has shown that RGen provides very good weak and strong scalability results. RGen is publicly available to download at https://github.com/rubenperez98/RGen, accessed on 30 September 2021.es_ES
dc.description.sponsorshipCITIC, as Research Center accredited by Galician University System, is funded by “Consellería de Cultura, Educación e Universidade from Xunta de Galicia”, supported in an 80% through ERDF, ERDF Operational Programme Galicia 2014–2020, and the remaining 20% by “Secretaría Xeral de Universidades (Grant ED431G 2019/01). This project was also supported by the “Consellería de Cultura, Educación e Ordenación Universitaria” via the Consolidation and Structuring of Competitive Research Units—Competitive Reference Groups (ED431C 2018/49 and 2021/30).es_ES
dc.description.sponsorshipXunta de Galicia; ED431G 2019/01es_ES
dc.description.sponsorshipXunta de Galicia; ED431C 2018/49es_ES
dc.description.sponsorshipXunta de Galicia; ED431C 2021/30es_ES
dc.language.isoenges_ES
dc.publisherMDPIes_ES
dc.relation.urihttps://doi.org/10.3390/engproc2021007013es_ES
dc.rightsAtribución 3.0 Españaes_ES
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/es/*
dc.subjectData generatores_ES
dc.subjectMapReducees_ES
dc.subjectHDFSes_ES
dc.subjectApache Hadoopes_ES
dc.subjectJavaes_ES
dc.subjectBig Dataes_ES
dc.subjectBenchmarkinges_ES
dc.titleRGen: Data Generator for Benchmarking Big Data Workloadses_ES
dc.typeinfo:eu-repo/semantics/conferenceObjectes_ES
dc.rights.accessinfo:eu-repo/semantics/openAccesses_ES
UDC.journalTitleEngineering Proceedingses_ES
UDC.volume7es_ES
UDC.issue1es_ES
UDC.startPage13es_ES
dc.identifier.doi10.3390/engproc2021007013


Ficheiros no ítem

Thumbnail
Thumbnail

Este ítem aparece na(s) seguinte(s) colección(s)

Mostrar o rexistro simple do ítem