Buscar
Mostrando ítems 1-10 de 13
Characterization of message-passing overhead on the AP3000 multicomputer
(IEEE, 2001-09)
[Abstract] The performance of the communication primitives of parallel computers is critical for the overall system performance. The characterization of the communication overhead is very important to estimate the global ...
Developing adaptive multi-device applications with the Heterogeneous Programming Library
(Springer, 2015)
[Abstract] The usage of heterogeneous devices presents two main problems. One is their complex programming, a problem that grows when multiple devices are used. The second issue is that even if the codes for these devices ...
A framework for argument-based task synchronization with automatic detection of dependencies
(Elsevier, 2013)
[Abstract] Synchronization in parallel applications can be achieved either implicitly or explicitly. Implicit synchronization is typical of programming environments that provide predefined, and often simple, patterns of ...
An Algorithm Template for Domain-Based Parallel Irregular Algorithms
(Springer, 2013)
[Abstract] The parallelization of irregular algorithms has not been as widely studied as the one of regular codes. In particular, while there are many proposals of parallel skeletons and libraries very well suited to regular ...
High Productivity Multi-device Exploitation with the Heterogeneous Programming Library
(Elsevier, 2016)
[Abstract] Heterogeneous devices require much more work from programmers than traditional CPUs, particularly when there are several of them, as each one has its own memory space. Multidevice applications require to distribute ...
Exploiting Heterogeneous Parallelism With the Heterogeneous Programming Library
(Elsevier, 2013)
[Abstract] While recognition of the advantages of heterogeneous computing is steadily growing, the issues of programmability and portability hinder its exploitation. The introduction of the OpenCL standard was a major step ...
Easy Dataflow Programming in Clusters with UPC++ DepSpawn
(Institute of Electrical and Electronics Engineers, 2019-06-01)
[Abstract]: The Partitioned Global Address Space (PGAS) programming model is one of the most relevant proposals to improve the ability of developers to exploit distributed memory systems. However, despite its important ...
Servet: A Benchmark Suite for Autotuning on Multicore Clusters
(Institute of Electrical and Electronics Engineers, 2010-05-24)
[Abstract] MapReduce is a powerful tool for processing large data sets used by many applications running in distributed environments. However, despite the increasing number of computationally intensive problems that require ...
NPB-MPJ: NAS Parallel Benchmarks Implementation for Message-Passing in Java
(IEEE Computer Society, 2009-05-08)
[Abstract] Java is a valuable and emerging alternative for the development of parallel applications, thanks to the availability of several Java message-passing libraries and its full multithreading support. The combination ...
Efficient Java Communication Protocols on High-speed Cluster Interconnects
(IEEE Computer Society, 2007-02-26)
[Abstract] This paper presents communication strategies for achieving efficient parallel and distributed Java applications on clusters with high-speed interconnects. Communication performance is critical for the overall ...