Mostrar o rexistro simple do ítem

dc.contributor.authorVeiga, Jorge
dc.contributor.authorExpósito, Roberto R.
dc.contributor.authorPardo, Xoán C.
dc.contributor.authorTaboada, Guillermo L.
dc.contributor.authorTouriño, Juan
dc.date.accessioned2019-07-02T14:28:27Z
dc.date.available2019-07-02T14:28:27Z
dc.date.issued2017-02-06
dc.identifier.citationJ. Veiga, R. R. Expósito, X. C. Pardo, G. L. Taboada and J. Tourifio, "Performance evaluation of big data frameworks for large-scale data analytics," 2016 IEEE International Conference on Big Data (Big Data), Washington, DC, 2016, pp. 424-431.es_ES
dc.identifier.urihttp://hdl.handle.net/2183/23359
dc.descriptionThis is a post-peer-review, pre-copyedit version of an article published. The final authenticated version is available online at: http://dx.doi.org/10.1109/BigData.2016.7840633es_ES
dc.description.abstract[Abstract] The increasing adoption of Big Data analytics has led to a high demand for efficient technologies in order to manage and process large datasets. Popular MapReduce frameworks such as Hadoop are being replaced by emerging ones like Spark or Flink, which improve both the programming APIs and performance. However, few works have focused on comparing these frameworks. This paper addresses this issue by performing a comparative evaluation of Hadoop, Spark and Flink using representative Big Data workloads and considering factors like performance and scalability. Moreover, the behavior of these frameworks has been characterized by modifying some of the main parameters of the workloads such as HDFS block size, input data size, interconnect network or thread configuration. The analysis of the results has shown that replacing Hadoop with Spark or Flink can lead to a reduction in execution times by 77% and 70% on average, respectively, for non-sort benchmarks.es_ES
dc.description.sponsorshipMinisterio de Ecnomía y Competitividad; TIN2013-42148-Pes_ES
dc.description.sponsorshipMinisterio de Educación; FPU14/02805es_ES
dc.language.isoenges_ES
dc.publisherIEEE Computer Societyes_ES
dc.relation.urihttp://dx.doi.org/10.1109/BigData.2016.7840633es_ES
dc.subjectSparkses_ES
dc.subjectBenchmark testinges_ES
dc.subjectBig Dataes_ES
dc.subjectGeneratorses_ES
dc.subjectProgramminges_ES
dc.subjectClustering algorithmses_ES
dc.subjectComputational modelinges_ES
dc.titlePerformance Evaluation of Big Data Frameworks for Large-Scale Data Analyticses_ES
dc.typeinfo:eu-repo/semantics/conferenceObjectes_ES
dc.typeinfo:eu-repo/semantics/conferenceObjectes_ES
dc.rights.accessinfo:eu-repo/semantics/openAccesses_ES
UDC.startPage424es_ES
UDC.endPage431es_ES
dc.identifier.doi10.1109/BigData.2016.7840633
UDC.conferenceTitle2016 IEEE International Conference on Big Data (Big Data)es_ES


Ficheiros no ítem

Thumbnail

Este ítem aparece na(s) seguinte(s) colección(s)

Mostrar o rexistro simple do ítem