Mostrar o rexistro simple do ítem

dc.contributor.authorTeijeiro Barjas, Carlos
dc.contributor.authorTaboada, Guillermo L.
dc.contributor.authorTouriño, Juan
dc.contributor.authorDoallo, Ramón
dc.contributor.authorMouriño, José C.
dc.contributor.authorMallón, Damián A.
dc.contributor.authorWibecan, Brian
dc.date.accessioned2018-11-07T15:45:18Z
dc.date.available2018-11-07T15:45:18Z
dc.date.issued2013
dc.identifier.citationTeijeiro, C., Taboada, G.L., Touriño, J. et al. J. Comput. Sci. Technol. (2013) 28: 72. https://doi.org/10.1007/s11390-013-1313-9es_ES
dc.identifier.issn1000-9000
dc.identifier.issn1860-4749
dc.identifier.urihttp://hdl.handle.net/2183/21247
dc.descriptionThis is a post-peer-review, pre-copyedit version of an article published in Journal of Computer Science and Technology. The final authenticated version is available online at: https://doi.org/10.1007/s11390-013-1313-9es_ES
dc.description.abstract[Abstract] Unified Parallel C (UPC) is a parallel extension of ANSI C based on the Partitioned Global Address Space (PGAS) programming model, which provides a shared memory view that simplifies code development while it can take advantage of the scalability of distributed memory architectures. Therefore, UPC allows programmers to write parallel applications on hybrid shared/distributed memory architectures, such as multi-core clusters, in a more productive way, accessing remote memory by means of different high-level language constructs, such as assignments to shared variables or collective primitives. However, the standard UPC collectives library includes a reduced set of eight basic primitives with quite limited functionality. This work presents the design and implementation of extended UPC collective functions that overcome the limitations of the standard collectives library, allowing, for example, the use of a specific source and destination thread or defining the amount of data transferred by each particular thread. This library fulfills the demands made by the UPC developers community and implements portable algorithms, independent of the specific UPC compiler/runtime being used. The use of a representative set of these extended collectives has been evaluated using two applications and four kernels as case studies. The results obtained confirm the suitability of the new library to provide easier programming without trading off performance, thus achieving high productivity in parallel programming to harness the performance of hybrid shared/distributed memory architectures in high performance computing.es_ES
dc.description.sponsorshipMinisterio de Ciencia e Innovación; TIN2010-16735es_ES
dc.description.sponsorshipXunta de Galcia; 2010/6es_ES
dc.language.isoenges_ES
dc.publisherSpringer New York LLCes_ES
dc.relation.urihttps://doi.org/10.1007/s11390-013-1313-9es_ES
dc.subjectUnified Parallel Ces_ES
dc.subjectCollective operationes_ES
dc.subjectProgrammabilityes_ES
dc.subjectPartitioned global address spacees_ES
dc.subjectHigh performance computinges_ES
dc.titleDesign and Implementation of an extended collectives library for unified Parallel Ces_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.rights.accessinfo:eu-repo/semantics/openAccesses_ES
UDC.journalTitleJournal of Computer Science and Technologyes_ES
UDC.volume28es_ES
UDC.issue1es_ES
UDC.startPage78es_ES
UDC.endPage89es_ES
dc.identifier.doi10.1007/s11390-013-1313-9


Ficheiros no ítem

Thumbnail

Este ítem aparece na(s) seguinte(s) colección(s)

Mostrar o rexistro simple do ítem