A Highly Optimized Skeleton for Unbalanced and Deep Divide-And-Conquer Algorithms on Multi-Core Clusters
| UDC.coleccion | Investigación | es_ES |
| UDC.departamento | Enxeñaría de Computadores | es_ES |
| UDC.endPage | 10454 | es_ES |
| UDC.grupoInv | Grupo de Arquitectura de Computadores (GAC) | es_ES |
| UDC.journalTitle | The Journal of Supercomputing | es_ES |
| UDC.startPage | 10434 | es_ES |
| UDC.volume | 78 | es_ES |
| dc.contributor.author | Álvarez Martínez, Millán | |
| dc.contributor.author | Fraguela, Basilio B. | |
| dc.contributor.author | Cabaleiro, José Carlos | |
| dc.date.accessioned | 2022-06-28T14:53:05Z | |
| dc.date.available | 2022-06-28T14:53:05Z | |
| dc.date.issued | 2022 | |
| dc.description | Financiado para publicación en acceso aberto: Universidade da Coruña/CISUG | es_ES |
| dc.description.abstract | [Abstract] Efficiently implementing the divide-and-conquer pattern of parallelism in distributed memory systems is very relevant, given its ubiquity, and difficult, given its recursive nature and the need to exchange tasks and data among the processors. This task is noticeably further complicated in the presence of multi-core systems, where hybrid parallelism must be exploited to attain the best performance, and when unbalanced and deep workloads are considered, as additional measures must be taken to load balance and avoid deep recursion problems. In this manuscript a parallel skeleton that fulfills all these requirements while providing high levels of usability is presented. In fact, the evaluation shows that our proposal is on average 415.32% faster than MPI codes and 229.18% faster than MPI + OpenMP benchmarks, while offering an average improvement in the programmability metrics of 131.04% over MPI alternatives and 155.18% over MPI + OpenMP solutions. | es_ES |
| dc.description.sponsorship | This research was supported by the Ministry of Science and Innovation of Spain (PID2019-104184RB-I00 and PID2019-104834GB-I00, AEI/FEDER/EU, 10.13039/501100011033) and the predoctoral Grant of Millán Álvarez Ref. BES-2017-081320), and by the Xunta de Galicia co-founded by the European Regional Development Fund (ERDF) under the Consolidation Programme of Competitive Reference Groups (ED431C 2018/19 and ED431C 2021/30). We acknowledge also the support from the Centro Singular de Investigación de Galicia “CITIC” and the Centro Singular de Investigación en Tecnoloxías Intelixentes “CiTIUS”, funded by Xunta de Galicia and the European Union (European Regional Development Fund- Galicia 2014-2020 Program), by Grants ED431G 2019/01 and ED431G 2019/04. We also acknowledge the Centro de Supercomputación de Galicia (CESGA). Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature | es_ES |
| dc.description.sponsorship | Xunta de Galicia; ED431C 2018/19 | es_ES |
| dc.description.sponsorship | Xunta de Galicia; ED431C 2021/30 | es_ES |
| dc.description.sponsorship | Xunta de Galicia; ED431G 2019/01 | es_ES |
| dc.description.sponsorship | Xunta de Galicia; ED431G 2019/04 | es_ES |
| dc.identifier.citation | Martínez, M.A., Fraguela, B.B. & Cabaleiro, J.C. A highly optimized skeleton for unbalanced and deep divide-and-conquer algorithms on multi-core clusters. J Supercomput 78, 10434–10454 (2022). https://doi.org/10.1007/s11227-021-04259-5 | es_ES |
| dc.identifier.doi | 10.1007/s11227-021-04259-5 | |
| dc.identifier.uri | http://hdl.handle.net/2183/31019 | |
| dc.language.iso | eng | es_ES |
| dc.publisher | Springer | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2019-104184RB-I00/ES/DESAFIOS ACTUALES EN HPC: ARQUITECTURAS, SOFTWARE Y APLICACIONES/ | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2019-104834GB-I00/ES/COMPUTACION DE ALTAS PRESTACIONES Y CLOUD PARA APLICACIONES DE ALTO INTERES/ | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/BES-2017-081320/ES/ | |
| dc.relation.uri | https://doi.org/10.1007/s11227-021-04259-5 | es_ES |
| dc.rights | Atribución 4.0 Internacional | es_ES |
| dc.rights.accessRights | open access | es_ES |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | * |
| dc.subject | Algorithmic skeletons | es_ES |
| dc.subject | Divide-and-conquer | es_ES |
| dc.subject | Template metaprogramming | es_ES |
| dc.subject | Load balancing | es_ES |
| dc.subject | Multi-core clusters | es_ES |
| dc.subject | Hybrid parallelism | es_ES |
| dc.title | A Highly Optimized Skeleton for Unbalanced and Deep Divide-And-Conquer Algorithms on Multi-Core Clusters | es_ES |
| dc.type | journal article | es_ES |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 7f5bae1c-08f6-4204-b22a-fbe20407a6e4 | |
| relation.isAuthorOfPublication.latestForDiscovery | 7f5bae1c-08f6-4204-b22a-fbe20407a6e4 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Martinez_Millan_A_2022_Highly_Optimized_Skeleton.pdf
- Size:
- 1.15 MB
- Format:
- Adobe Portable Document Format
- Description:

