Dense Matrix Multiplication Algorithms and Performance Evaluation of HPCC in 81 Nodes IBM Power 8 Architecture

Ver/ abrir
Use este enlace para citar
http://hdl.handle.net/2183/28859
A non ser que se indique outra cousa, a licenza do ítem descríbese como Atribución 4.0 Internacional (CC BY 4.0)
Coleccións
- Investigación (EPEF) [587]
Metadatos
Mostrar o rexistro completo do ítemTítulo
Dense Matrix Multiplication Algorithms and Performance Evaluation of HPCC in 81 Nodes IBM Power 8 ArchitectureAutor(es)
Data
2021-08Cita bibliográfica
Estévez Ruiz, E.P.; Chicaiza, G.E.C.; Patiño, F.R.J.; López Lago, J.C.; Thirumuruganandham, S.P. Dense Matrix Multiplication Algorithms and Performance Evaluation of HPCC in 81 Nodes IBM Power 8 Architecture. Computation 2021, 9, 86. https://doi.org/10.3390/computation9080086
Resumo
[Abstract] Optimizing HPC systems based on performance factors and bottlenecks is essential for designing an HPC infrastructure with the best characteristics and at a reasonable cost. Such insight can only be achieved through a detailed analysis of existing HPC systems and the execution of their workloads. The “Quinde I” is the only and most powerful supercomputer in Ecuador and is currently listed third on the South America. It was built with the IBM Power 8 servers. In this work, we measured its performance using different parameters from High-Performance Computing (HPC) to compare it with theoretical values and values obtained from tests on similar models. To measure its performance, we compiled and ran different benchmarks with the specific optimization flags for Power 8 to get the maximum performance with the current configuration in the hardware installed by the vendor. The inputs of the benchmarks were varied to analyze their impact on the system performance. In addition, we compile and compare the performance of two algorithms for dense matrix multiplication SRUMMA and DGEMM.
Palabras chave
Supercomputer
Performance
Benchmark
IBM Power 8
HPC
Cluster
DGEMM
SRUMMA
Parallel Computing
Performance
Benchmark
IBM Power 8
HPC
Cluster
DGEMM
SRUMMA
Parallel Computing
Versión do editor
Dereitos
Atribución 4.0 Internacional (CC BY 4.0)
ISSN
2079-3197