Browsing GI-GAC - Congresos, conferencias, etc. by Issue Date
Now showing items 1-20 of 47
-
Parallel Sparse Modified Gram-Schmidt QR Decomposition
(Springer, 1996)[Abstract] We present a parallel computational method for the QR decomposition with column pivoting of a sparse matrix by means of Modified Gram-Schmidt orthogonalization. Nonzero elements of the matrix M to be decomposed ... -
Sparse Householder QR factorization on a mesh
(IEEE Computer Society, 1996-01)[Abstract] We analyze the parallelization of QR factorization by means of Householder transformations. This parallelization is carried out on a machine with a mesh topology (a 2-D torus to be more precise). We use a cyclic ... -
Sparse Givens QR Factorization on a Multiprocessor
(IEEE Computer Society, 1996-06-01)[Abstract] We present a parallel algorithm for the QR factorization with column pivoting of a sparse matrix by means of Givens rotations. Nonzero elements of the matrix M to be decomposed are stored in a one dimensional ... -
A PVM Based Library for Sparse Matrix Factorizations
(Springer, 1998)[Abstract] We present 3LM, a C Linked List Management Library for parallel sparse factorizations on a PVM environment which takes into account the fill-in, an important drawback of sparse computations. It is restricted to ... -
HPF-2 Support for Dynamic Sparse Computations
(Springer, 1998)[Abstract] There is a class of sparse matrix computations, such as direct solvers of systems of linear equations, that change the fill-in (nonzero entries) of the coefficient matrix, and involve row and column operations ... -
Characterization of message-passing overhead on the AP3000 multicomputer
(IEEE, 2001-09)[Abstract] The performance of the communication primitives of parallel computers is critical for the overall system performance. The characterization of the communication overhead is very important to estimate the global ... -
Exploiting locality in the run-time parallelization of irregular loops
(C R C Press, LLC, 2002-12-10)[Abstract] The goal of this work is the efficient parallel execution of loops with indirect array accesses, in order to be embedded in a parallelizing compiler framework. In this kind of loop pattern, dependences can not ... -
An Inspector-Executor Algorithm for Irregular Assignment Parallelization
(Springer, 2004)[Abstract] A loop with irregular assignment computations contains loop-carried output data dependences that can only be detected at run-time. In this paper, a load-balanced method based on the inspector-executor model is ... -
Performance analysis of Java message-passing libraries on fast Ethernet, Myrinet and SCI clusters
(IEEE Computer Society, 2004-01-08)[Abstract] The use of Java for parallel programming on clusters according to the message-passing paradigm is an attractive choice. In this case, the overall application performance will largely depend on the performance ... -
Compiler support for parallel code generation through kernel recognition
(IEEE Computer Society, 2004-06-07)[Abstract] Summary form only given. The automatic parallelization of loops that contain complex computations is still a challenge for current parallelizing compilers. The main limitations are related to the analysis of ... -
Non-blocking Java Communications Support on Clusters
(Springer, 2006)[Abstract] This paper presents communication strategies for supporting efficient non-blocking Java communication on clusters. The communication performance is critical for the overall cluster performance. It is possible ... -
Towards Low-Latency Model-Oriented Distributed Systems Management
(Springer, 2007)[Abstract] Windows and Unix systems have been traditionally very different with regard to configuration storage and management. In this paper we have adapted our CIM-based model-driven management framework, AdCIM, to ... -
Program Behavior Characterization Through Advanced Kernel Recognition
(Springer, 2007)[Abstract] Understanding program behavior is at the foundation of program optimization. Techniques for automatic recognition of program constructs (from now on, computational kernels) characterize the behavior of program ... -
Efficient Java Communication Protocols on High-speed Cluster Interconnects
(IEEE Computer Society, 2007-02-26)[Abstract] This paper presents communication strategies for achieving efficient parallel and distributed Java applications on clusters with high-speed interconnects. Communication performance is critical for the overall ... -
High Performance Java Remote Method Invocation for Parallel Computing on Clusters
(IEEE Computer Society, 2007-11-12)[Abstract] This paper presents a more efficient Java remote method invocation (RMI) implementation for high-speed clusters. The use of Java for parallel programming on clusters is limited by the lack of efficient communication ... -
Integrating the common information model with MDS4
(IEEE Computer Society, 2008-10-31)[Abstract] The management and monitoring of static and dynamic resources is a key issue in grid environments. Information models are an abstract representation of software and hardware aspects of these resources, a common ... -
Performance Evaluation of MPI, UPC and OpenMP on Multicore Architectures
(Springer, 2009)[Abstract] The current trend to multicore architectures underscores the need of parallelism. While new languages and alternatives for supporting more efficiently these systems are proposed, MPI faces this new challenge. ... -
Ontological Con guration Management for Wireless Mesh Routers
(Springer, 2009)[Abstract] Wireless mesh networks (WMNs) are a category of wireless networks that are self-organized, robust and which offer more flexible client coverage with less equipment requirements than wired networks. In WMNs, mesh ... -
NPB-MPJ: NAS Parallel Benchmarks Implementation for Message-Passing in Java
(IEEE Computer Society, 2009-05-08)[Abstract] Java is a valuable and emerging alternative for the development of parallel applications, thanks to the availability of several Java message-passing libraries and its full multithreading support. The combination ... -
Performance Evaluation of Unified Parallel C Collective Communications
(IEEE Computer Society, 2009-07-17)[Abstract] Unified Parallel C (UPC) is an extension of ANSI C designed for parallel programming. UPC collective primitives, which are part of the UPC standard, increase programming productivity while reducing the communication ...