Communication avoiding and overlapping for numerical linear algebra

Georganas, Evangelos; González-Domínguez, Jorge; Solomonik, Edgar; Zheng, Yili; Touriño, Juan; Yelick, Katherine

dc.contributor.author	Georganas, Evangelos
dc.contributor.author	González-Domínguez, Jorge
dc.contributor.author	Solomonik, Edgar
dc.contributor.author	Zheng, Yili
dc.contributor.author	Touriño, Juan
dc.contributor.author	Yelick, Katherine
dc.date.accessioned	2019-07-03T17:30:20Z
dc.date.available	2019-07-03T17:30:20Z
dc.date.issued	2013-02-25
dc.identifier.citation	E. Georganas, J. Gonzalez-Dominguez, E. Solomonik, Y. Zheng, J. Tourino and K. Yelick, "Communication avoiding and overlapping for numerical linear algebra," SC '12: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, Salt Lake City, UT, 2012, pp. 1-11.	es_ES
dc.identifier.other	INSPEC Accession Number: 13372346
dc.identifier.uri	http://hdl.handle.net/2183/23395
dc.description	This is a post-peer-review, pre-copyedit version. The final authenticated version is available online at: http://dx.doi.org/10.1109/SC.2012.32	es_ES
dc.description.abstract	[Abstract] To efficiently scale dense linear algebra problems to future exascale systems, communication cost must be avoided or overlapped. Communication-avoiding 2.5D algorithms improve scalability by reducing inter-processor data transfer volume at the cost of extra memory usage. Communication overlap attempts to hide messaging latency by pipelining messages and overlapping with computational work. We study the interaction and compatibility of these two techniques for two matrix multiplication algorithms (Cannon and SUMMA), triangular solve, and Cholesky factorization. For each algorithm, we construct a detailed performance model that considers both critical path dependencies and idle time. We give novel implementations of 2.5D algorithms with overlap for each of these problems. Our software employs UPC, a partitioned global address space (PGAS) language that provides fast one-sided communication. We show communication avoidance and overlap provide a cumulative benefit as core counts scale, including results using over 24K cores of a Cray XE6 system.	es_ES
dc.description.sponsorship	Office of Science of the U.S. Department of Energy; DE-AC02-05CH11231	es_ES
dc.description.sponsorship	Office of Science of the U.S. Department of Energy; DARPA HR0011-10-9-0008	es_ES
dc.description.sponsorship	Ministerio de Ciencia e Innovación; TIN2010-16735	es_ES
dc.description.sponsorship	Ministerio de Educación; AP2008-01578	es_ES
dc.description.sponsorship	Krell Department of Energy Computational Science Graduate Fellowship; DE-FG02-7ER25308	es_ES
dc.description.sponsorship	Office of Science of the U.S. Department of Energy; DE-AC02-05CH11231.	es_ES
dc.language.iso	eng	es_ES
dc.publisher	IEEE Computer Society	es_ES
dc.relation.uri	http://dx.doi.org/10.1109/SC.2012.32	es_ES
dc.subject	Program processors	es_ES
dc.subject	Bandwidth	es_ES
dc.subject	Partitioning algorithms	es_ES
dc.subject	Linear algebra	es_ES
dc.subject	Message systems	es_ES
dc.subject	Hardware	es_ES
dc.subject	Layout	es_ES
dc.title	Communication avoiding and overlapping for numerical linear algebra	es_ES
dc.type	info:eu-repo/semantics/conferenceObject	es_ES
dc.rights.access	info:eu-repo/semantics/openAccess	es_ES
UDC.startPage	1	es_ES
UDC.endPage	11	es_ES
dc.identifier.doi	10.1109/SC.2012.32
UDC.conferenceTitle	SC '12: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis	es_ES

Ficheiros no ítem

Nome:: E.Georganas_2012_Communication ...
Tamaño:: 511.1Kb
Formato:: PDF

Ver/abrir

Este ítem aparece na(s) seguinte(s) colección(s)

GI-GAC - Congresos, conferencias, etc. [55]

Mostrar o rexistro simple do ítem