Listar Grupo de Arquitectura de Computadores (GAC) por título
Mostrando ítems 52-71 de 237
-
Characterization of message-passing overhead on the AP3000 multicomputer
(IEEE, 2001-09)[Abstract] The performance of the communication primitives of parallel computers is critical for the overall system performance. The characterization of the communication overhead is very important to estimate the global ... -
Clupiter: a Raspberry Pi mini-supercomputer for educational purposes
(Institute of Electrical and Electronics Engineers, 2024)[Abstract]: The main objective of this work is to bring supercomputing and parallel processing closer to non-specialized audiences by building a Raspberry Pi cluster, called Clupiter, which emulates the operation of a ... -
Communication avoiding and overlapping for numerical linear algebra
(IEEE Computer Society, 2013-02-25)[Abstract] To efficiently scale dense linear algebra problems to future exascale systems, communication cost must be avoided or overlapped. Communication-avoiding 2.5D algorithms improve scalability by reducing inter-processor ... -
Comparison of Hardwired and Microprogrammed Statechart Implementations
(MDPI, 2020)[Abstract]: In scientific facilities such as particle accelerators, fast and jitter-free synchronization is required in order to trigger a large number of actuators at the right time in a variety of situations. The behaviour ... -
Compiler support for parallel code generation through kernel recognition
(IEEE Computer Society, 2004-06-07)[Abstract] Summary form only given. The automatic parallelization of loops that contain complex computations is still a challenge for current parallelizing compilers. The main limitations are related to the analysis of ... -
Compiler-Assisted Checkpointing of Parallel Codes: The Cetus and LLVM Experience
(Springer New York LLC, 2013)[Abstract] With the evolution of high-performance computing, parallel applications have developed an increasing necessity for fault tolerance, most commonly provided by checkpoint and restart techniques. Checkpointing tools ... -
Concept Drift Detection and Adaptation for Federated and Continual Learning
(Springer, 2021)[Abstract] Smart devices, such as smartphones, wearables, robots, and others, can collect vast amounts of data from their environment. This data is suitable for training machine learning models, which can significantly ... -
CPPC: a compiler‐assisted tool for portable checkpointing of message‐passing applications
(John Wiley & Sons Ltd., 2010-11-19)[Abstract] With the evolution of high‐performance computing toward heterogeneous, massively parallel systems, parallel applications have developed new checkpoint and restart necessities. Whether due to a failure in the ... -
CUDA acceleration of MI-based feature selection methods
(Elsevier, 2024-08)[Abstract]: Feature selection algorithms are necessary nowadays for machine learning as they are capable of removing irrelevant and redundant information to reduce the dimensionality of the data and improve the quality of ... -
CUDA-JMI: Acceleration of feature selection on heterogeneous systems
(Elsevier, 2020-01)[Abstract]: Feature selection is a crucial step nowadays in machine learning and data analytics to remove irrelevant and redundant characteristics and thus to provide fast and reliable analyses. Many research works have ... -
Design and Implementation of an extended collectives library for unified Parallel C
(Springer New York LLC, 2013)[Abstract] Unified Parallel C (UPC) is a parallel extension of ANSI C based on the Partitioned Global Address Space (PGAS) programming model, which provides a shared memory view that simplifies code development while it ... -
Design and Implementation of MapReduce using the PGAS Programming Model with UPC
(IEEE Computer Society, 2012-01-03)[Abstract] MapReduce is a powerful tool for processing large data sets used by many applications running in distributed environments. However, despite the increasing number of computationally intensive problems that require ... -
Design of efficient Java message-passing collectives on multi-core clusters
(Springer New York LLC, 2011-02)[Abstract] This paper presents a scalable and efficient Message-Passing in Java (MPJ) collective communication library for parallel computing on multi-core architectures. The continuous increase in the number of cores per ... -
Design of Scalable Java Communication Middleware for Multi-Core Systems
(Oxford University Press, 2013-02-01)[Abstract] This paper presents smdev, a shared memory communication middleware for multi-core systems. smdev provides a simple and powerful messaging application program interface that is able to exploit the underlying ... -
Design of scalable Java message-passing communications over InfiniBand
(Springer New York LLC, 2012-07)[Abstract] This paper presents ibvdev a scalable and efficient low-level Java message-passing communication device over InfiniBand. The continuous increase in the number of cores per processor underscores the need for ... -
Developing adaptive multi-device applications with the Heterogeneous Programming Library
(Springer, 2015)[Abstract] The usage of heterogeneous devices presents two main problems. One is their complex programming, a problem that grows when multiple devices are used. The second issue is that even if the codes for these devices ... -
Device level communication libraries for high‐performance computing in Java
(John Wiley & Sons Ltd., 2011-12-25)[Abstract] Since its release, the Java programming language has attracted considerable attention from the high‐performance computing (HPC) community because of its portability, high programming productivity, and built‐in ... -
Does the choice of nucleotide substitution models matter topologically?
(BioMed Central Ltd., 2016-03-24)[Abstract] Background: In the context of a master level programming practical at the computer science department of the Karlsruhe Institute of Technology, we developed and make available an open-source code for testing all ... -
Easy Dataflow Programming in Clusters with UPC++ DepSpawn
(Institute of Electrical and Electronics Engineers, 2019-06-01)[Abstract]: The Partitioned Global Address Space (PGAS) programming model is one of the most relevant proposals to improve the ability of developers to exploit distributed memory systems. However, despite its important ... -
Efficient Culling Techniques for Interactive Deformable NURBS Surfaces on GPU
(SciTePress, 2016-02)[Abstrtact] InfoValue: NURBS (Non-uniform rational B-splines) surfaces are the standard freeform representation in Computer-Aided Design (CAD) applications. Rendering NURBS surfaces accurately while they are interactively ...