Mostrar o rexistro simple do ítem
A 2D algorithm with asymmetric workload for the UPC conjugate gradient method
dc.contributor.author | González-Domínguez, Jorge | |
dc.contributor.author | Marques, Osni A. | |
dc.contributor.author | Martín, María J. | |
dc.contributor.author | Touriño, Juan | |
dc.date.accessioned | 2018-08-14T12:12:40Z | |
dc.date.available | 2018-08-14T12:12:40Z | |
dc.date.issued | 2014 | |
dc.identifier.citation | González-Domínguez, J., Marques, O.A., Martín, M.J. et al. J Supercomput (2014) 70: 816. https://doi.org/10.1007/s11227-014-1300-0 | es_ES |
dc.identifier.issn | 0920-8542 | |
dc.identifier.issn | 1573-0484 | |
dc.identifier.uri | http://hdl.handle.net/2183/20967 | |
dc.description | This is a post-peer-review, pre-copyedit version of an article published in Journal of Supercomputing. The final authenticated version is available online at: https://doi.org/10.1007/s11227-014-1300-0 | es_ES |
dc.description.abstract | [Abstract] This paper examines four different strategies, each one with its own data distribution, for implementing the parallel conjugate gradient (CG) method and how they impact communication and overall performance. Firstly, typical 1D and 2D distributions of the matrix involved in CG computations are considered. Then, a new 2D version of the CG method with asymmetric workload, based on leaving some threads idle during part of the computation to reduce communication, is proposed. The four strategies are independent of sparse storage schemes and are implemented using Unified Parallel C (UPC), a Partitioned Global Address Space (PGAS) language. The strategies are evaluated on two different platforms through a set of matrices that exhibit distinct sparse patterns, demonstrating that our asymmetric proposal outperforms the others except for one matrix on one platform. | es_ES |
dc.description.sponsorship | Ministerio de Economía y Competitividad; TIN2013-42148-P | es_ES |
dc.description.sponsorship | Xunta de Galicia; GRC2013/055 | es_ES |
dc.description.sponsorship | United States. Department of Energy; DEAC03-76SF00098 | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | Springer New York LLC | es_ES |
dc.relation.uri | https://doi.org/10.1007/s11227-014-1300-0 | es_ES |
dc.subject | Conjugate gradient | es_ES |
dc.subject | PGAS | es_ES |
dc.subject | UPC | es_ES |
dc.subject | Performance optimization | es_ES |
dc.subject | Data distribution | es_ES |
dc.title | A 2D algorithm with asymmetric workload for the UPC conjugate gradient method | es_ES |
dc.type | info:eu-repo/semantics/article | es_ES |
dc.rights.access | info:eu-repo/semantics/openAccess | es_ES |
UDC.journalTitle | The Journal of Supercomputing | es_ES |
UDC.volume | 70 | es_ES |
UDC.issue | 2 | es_ES |
UDC.startPage | 816 | es_ES |
UDC.endPage | 829 | es_ES |
dc.identifier.doi | 10.1007/s11227-014-1300-0 |
Ficheiros no ítem
Este ítem aparece na(s) seguinte(s) colección(s)
-
GI-GAC - Artigos [192]