Search
Now showing items 1-10 of 43
Performance analysis of Java message-passing libraries on fast Ethernet, Myrinet and SCI clusters
(IEEE Computer Society, 2004-01-08)
[Abstract] The use of Java for parallel programming on clusters according to the message-passing paradigm is an attractive choice. In this case, the overall application performance will largely depend on the performance ...
Ontological Con guration Management for Wireless Mesh Routers
(Springer, 2009)
[Abstract] Wireless mesh networks (WMNs) are a category of wireless networks that are self-organized, robust and which offer more flexible client coverage with less equipment requirements than wired networks. In WMNs, mesh ...
Characterization of message-passing overhead on the AP3000 multicomputer
(IEEE, 2001-09)
[Abstract] The performance of the communication primitives of parallel computers is critical for the overall system performance. The characterization of the communication overhead is very important to estimate the global ...
Program Behavior Characterization Through Advanced Kernel Recognition
(Springer, 2007)
[Abstract] Understanding program behavior is at the foundation of program optimization. Techniques for automatic recognition of program constructs (from now on, computational kernels) characterize the behavior of program ...
Towards Low-Latency Model-Oriented Distributed Systems Management
(Springer, 2007)
[Abstract] Windows and Unix systems have been traditionally very different with regard to configuration storage and management. In this paper we have adapted our CIM-based model-driven management framework, AdCIM, to ...
Sparse Givens QR Factorization on a Multiprocessor
(IEEE Computer Society, 1996-06-01)
[Abstract]
We present a parallel algorithm for the QR factorization with column pivoting of a sparse matrix by means of Givens rotations. Nonzero elements of the matrix M to be decomposed are stored in a one dimensional ...
An Inspector-Executor Algorithm for Irregular Assignment Parallelization
(Springer, 2004)
[Abstract] A loop with irregular assignment computations contains loop-carried output data dependences that can only be detected at run-time. In this paper, a load-balanced method based on the inspector-executor model is ...
Compiler-Assisted Checkpointing of Parallel Codes: The Cetus and LLVM Experience
(Springer New York LLC, 2013)
[Abstract] With the evolution of high-performance computing, parallel applications have developed an increasing necessity for fault tolerance, most commonly provided by checkpoint and restart techniques. Checkpointing tools ...
UPCBLAS: a library for parallel matrix computations in Unified Parallel C
(John Wiley & Sons Ltd., 2012-09-25)
[Abstract] The popularity of Partitioned Global Address Space (PGAS) languages has increased during the last years thanks to their high programmability and performance through an efficient exploitation of data locality, ...
Evaluation of messaging middleware for high-performance cloud computing
(Springer U K, 2013-12)
[Abstract]
Cloud computing is posing several challenges, such as security, fault tolerance, access interface singularity, and network constraints, both in terms of latency and bandwidth. In this scenario, the performance ...