Performance Evaluation of Big Data Frameworks for Large-Scale Data Analytics

UDC.coleccionInvestigaciónes_ES
UDC.conferenceTitle2016 IEEE International Conference on Big Data (Big Data)es_ES
UDC.departamentoEnxeñaría de Computadoreses_ES
UDC.endPage431es_ES
UDC.grupoInvGrupo de Arquitectura de Computadores (GAC)es_ES
UDC.startPage424es_ES
dc.contributor.authorVeiga, Jorge
dc.contributor.authorExpósito, Roberto R.
dc.contributor.authorPardo, Xoán C.
dc.contributor.authorTaboada, Guillermo L.
dc.contributor.authorTouriño, Juan
dc.date.accessioned2019-07-02T14:28:27Z
dc.date.available2019-07-02T14:28:27Z
dc.date.issued2017-02-06
dc.descriptionThis is a post-peer-review, pre-copyedit version of an article published. The final authenticated version is available online at: http://dx.doi.org/10.1109/BigData.2016.7840633es_ES
dc.description.abstract[Abstract] The increasing adoption of Big Data analytics has led to a high demand for efficient technologies in order to manage and process large datasets. Popular MapReduce frameworks such as Hadoop are being replaced by emerging ones like Spark or Flink, which improve both the programming APIs and performance. However, few works have focused on comparing these frameworks. This paper addresses this issue by performing a comparative evaluation of Hadoop, Spark and Flink using representative Big Data workloads and considering factors like performance and scalability. Moreover, the behavior of these frameworks has been characterized by modifying some of the main parameters of the workloads such as HDFS block size, input data size, interconnect network or thread configuration. The analysis of the results has shown that replacing Hadoop with Spark or Flink can lead to a reduction in execution times by 77% and 70% on average, respectively, for non-sort benchmarks.es_ES
dc.description.sponsorshipMinisterio de Ecnomía y Competitividad; TIN2013-42148-Pes_ES
dc.description.sponsorshipMinisterio de Educación; FPU14/02805es_ES
dc.identifier.citationJ. Veiga, R. R. Expósito, X. C. Pardo, G. L. Taboada and J. Tourifio, "Performance evaluation of big data frameworks for large-scale data analytics," 2016 IEEE International Conference on Big Data (Big Data), Washington, DC, 2016, pp. 424-431.es_ES
dc.identifier.doi10.1109/BigData.2016.7840633
dc.identifier.urihttp://hdl.handle.net/2183/23359
dc.language.isoenges_ES
dc.publisherIEEE Computer Societyes_ES
dc.relation.urihttp://dx.doi.org/10.1109/BigData.2016.7840633es_ES
dc.rights.accessRightsopen accesses_ES
dc.subjectSparkses_ES
dc.subjectBenchmark testinges_ES
dc.subjectBig Dataes_ES
dc.subjectGeneratorses_ES
dc.subjectProgramminges_ES
dc.subjectClustering algorithmses_ES
dc.subjectComputational modelinges_ES
dc.titlePerformance Evaluation of Big Data Frameworks for Large-Scale Data Analyticses_ES
dc.typeconference outputes_ES
dspace.entity.typePublication
relation.isAuthorOfPublication0ef9135c-b7c9-48f1-8f06-55c025236916
relation.isAuthorOfPublication6a6967e9-a4f5-4006-afee-4fc9d5f3a658
relation.isAuthorOfPublication39e887b1-611f-4ca0-9fc3-32245bf93f9f
relation.isAuthorOfPublication86e306a5-99a1-4c43-8faa-720f0a9f0a34
relation.isAuthorOfPublication.latestForDiscovery0ef9135c-b7c9-48f1-8f06-55c025236916

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
J.Veiga_2016_Performance_Evaluation_of_Big_Data_Frameworks_for_Large-scale_Data_Analytics.pdf
Size:
289.91 KB
Format:
Adobe Portable Document Format
Description: