Buscar
Mostrando ítems 11-14 de 14
Easy Dataflow Programming in Clusters with UPC++ DepSpawn
(Institute of Electrical and Electronics Engineers, 2019-06-01)
[Abstract]: The Partitioned Global Address Space (PGAS) programming model is one of the most relevant proposals to improve the ability of developers to exploit distributed memory systems. However, despite its important ...
A Fast Solver for Large Tridiagonal Systems on Multi-Core Processors (Lass Library)
(Institute of Electrical and Electronics Engineers, 2019)
[Abstract]: Many problems of industrial and scientific interest require the solving of tridiagonal linear systems. This paper presents several implementations for the parallel solving of large tridiagonal systems on ...
Numerical Simulation of Pollutant Transport in a Shallow-Water System on the Cell Heterogeneous Processor
(Springer, 2013)
[Abstract] This paper presents an implementation, optimized for the Cell processor, of a finite volume numerical scheme for 2D shallow-water systems with pollutant transport. A description of the special architecture and ...
STuning-DL: Model-Driven Autotuning of Sparse GPU Kernels for Deep Learning
(Institute of Electrical and Electronics Engineers, 2024-05)
[Abstract]: The relentless growth of modern Machine Learning models has spurred the adoption of sparsification techniques to simplify their architectures and reduce the computational demands. Network pruning has demonstrated ...