Buscar
Mostrando ítems 21-29 de 29
Easy Dataflow Programming in Clusters with UPC++ DepSpawn
(Institute of Electrical and Electronics Engineers, 2019-06-01)
[Abstract]: The Partitioned Global Address Space (PGAS) programming model is one of the most relevant proposals to improve the ability of developers to exploit distributed memory systems. However, despite its important ...
A Fast Solver for Large Tridiagonal Systems on Multi-Core Processors (Lass Library)
(Institute of Electrical and Electronics Engineers, 2019)
[Abstract]: Many problems of industrial and scientific interest require the solving of tridiagonal linear systems. This paper presents several implementations for the parallel solving of large tridiagonal systems on ...
Parallelization of shallow water simulations on current multi-threaded systems
(SAGE Journals, 2013-11)
[Abstract]: In this work, several parallel implementations of a numerical model of pollutant transport on a shallow water system are presented. These parallel implementations are developed in two phases. First, the sequential ...
Numerical Simulation of Pollutant Transport in a Shallow-Water System on the Cell Heterogeneous Processor
(Springer, 2013)
[Abstract] This paper presents an implementation, optimized for the Cell processor, of a finite volume numerical scheme for 2D shallow-water systems with pollutant transport. A description of the special architecture and ...
A multi-GPU shallow-water simulation with transport of contaminants
(Wiley, 2012)
[Abstract] This work presents cost-effective multi-graphics processing unit (GPU) parallel implementations of a finite-volume numerical scheme for solving pollutant transport problems in bidimensional domains. The fluid ...
A Parallel Skeleton for Divide-and-conquer Unbalanced and Deep Problems
(Springer Nature, 2021)
[Abstract] The Divide-and-conquer (D&C) pattern appears in a large number of problems and is highly suitable to exploit parallelism. This has led to much research on its easy and efficient application both in shared and ...
A new thread-level speculative automatic parallelization model and library based on duplicate code execution
(Springer Nature, 2024-03-11)
Loop-efficient automatic parallelization has become increasingly relevant due to the growing number of cores in current processors and the programming effort needed to parallelize codes in these systems efficiently. However, ...
A Highly Optimized Skeleton for Unbalanced and Deep Divide-And-Conquer Algorithms on Multi-Core Clusters
(Springer, 2022)
[Abstract] Efficiently implementing the divide-and-conquer pattern of parallelism in distributed memory systems is very relevant, given its ubiquity, and difficult, given its recursive nature and the need to exchange tasks ...
Automatic mapping of parallel applications on multicore architectures using the Servet benchmark suite
(Pergamon Press, 2012-03)
[Abstract] Servet is a suite of benchmarks focused on detecting a set of parameters with high influence on the overall performance of multicore systems. These parameters can be used for autotuning codes to increase their ...