Mostrar o rexistro simple do ítem

dc.contributor.authorEiras-Franco, Carlos
dc.contributor.authorBolón-Canedo, Verónica
dc.contributor.authorRamos Garea, Sabela
dc.contributor.authorGonzález-Domínguez, Jorge
dc.contributor.authorAlonso-Betanzos, Amparo
dc.contributor.authorTouriño, Juan
dc.date.accessioned2023-12-21T11:33:01Z
dc.date.available2023-12-21T11:33:01Z
dc.date.issued2016
dc.identifier.citationC. Eiras-Franco, V. Bolón-Canedo, S. Ramos, J. González-Domínguez, A. Alonso-Betanzos, and J. Touriño, "Multithreaded and Spark parallelization of feature selection filters", Journal of Computational Science, Vol. 17, Part 3, Nov. 2016, Pp. 609-619, https://doi.org/10.1016/j.jocs.2016.07.002es_ES
dc.identifier.urihttp://hdl.handle.net/2183/34589
dc.description©2016 Elsevier B.V. All rights reserved. This manuscript version is made available under the CC-BY-NC-ND 4.0 license https://creativecommons.org/licenses/bync-nd/4.0/. This version of the article has been accepted for publication in Journal of Computational Science. The Version of Record is available online at https://doi.org/10.1016/j.jocs.2016.07.002es_ES
dc.descriptionVersión final aceptada de: C. Eiras-Franco, V. Bolón-Canedo, S. Ramos, J. González-Domínguez, A. Alonso-Betanzos, and J. Touriño, "Multithreaded and Spark parallelization of feature selection filters", Journal of Computational Science, Vol. 17, Part 3, Nov. 2016, Pp. 609-619es_ES
dc.description.abstract[Abstract]: Vast amounts of data are generated every day, constituting a volume that is challenging to analyze. Techniques such as feature selection are advisable when tackling large datasets. Among the tools that provide this functionality, Weka is one of the most popular ones, although the implementations it provides struggle when processing large datasets, requiring excessive times to be practical. Parallel processing can help alleviate this problem, effectively allowing users to work with Big Data. The computational power of multicore machines can be harnessed by using multithreading and distributed programming, effectively helping to tackle larger problems. Both these techniques can dramatically speed up the feature selection process allowing users to work with larger datasets. The reimplementation of four popular feature selection algorithms included in Weka is the focus of this work. Multithreaded implementations previously not included in Weka as well as parallel Spark implementations were developed for each algorithm. Experimental results obtained from tests on real-world datasets show that the new versions offer significant reductions in processing times.es_ES
dc.description.sponsorshipThis work has been financed in part by Xunta de Galicia under Research Network R2014/041 and project GRC2014/035, and by Spanish Ministerio de Economía y Competitividad under projects TIN2012-37954 and TIN-2015-65069-C2-1-R, partially funded by FEDER funds of the European Union. V. Bolón-Canedo acknowledges support of the Xunta de Galicia under postdoctoral Grant code ED481B 2014/164-0. Additionally, the collaboration of Jorge Veiga on setting up and using the MREv tool for Spark execution was essential for this work.es_ES
dc.description.sponsorshipXunta de Galicia; R2014/041es_ES
dc.description.sponsorshipXunta de Galicia; GRC2014/035es_ES
dc.description.sponsorshipXunta de Galicia; ED481B 2014/164-0es_ES
dc.language.isoenges_ES
dc.relationinfo:eu-repo/grantAgreement/MINECO/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN2012-37954/ES/ALGORITMOS DE APRENDIZAJE COMPUTACIONAL EN ENTORNOS DISTRIBUIDOSes_ES
dc.relationinfo:eu-repo/grantAgreement/MINECO/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN-2015-65069-C2-1-R/ALGORITMOS ESCALABLES DE APRENDIZAJE COMPUTACIONAL: MAS ALLA DE LA CLASIFICACION Y LA REGRESIONes_ES
dc.relation.isversionofhttps://doi.org/10.1016/j.jocs.2016.07.002
dc.relation.urihttps://doi.org/10.1016/j.jocs.2016.07.002es_ES
dc.rightsAtribución-NoComercial-SinDerivadas 3.0 Españaes_ES
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/*
dc.subjectMultithreadinges_ES
dc.subjectSparkes_ES
dc.subjectFeature selectiones_ES
dc.subjectMachine learninges_ES
dc.titleMultithreaded and Spark parallelization of feature selection filterses_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.rights.accessinfo:eu-repo/semantics/openAccesses_ES
UDC.journalTitleJournal of Computational Sciencees_ES
UDC.volume17es_ES
UDC.issue3es_ES
UDC.startPage609es_ES
UDC.endPage619es_ES
dc.identifier.doi10.1016/j.jocs.2016.07.002


Ficheiros no ítem

Thumbnail
Thumbnail

Este ítem aparece na(s) seguinte(s) colección(s)

Mostrar o rexistro simple do ítem