Search
Now showing items 1-10 of 237
Probing the Efficacy of Hardware-Aware Weight Pruning to Optimize the SpMM routine on Ampere GPUs
(Institute of Electrical and Electronics Engineers, 2022)
[Abstract]: The Deep Learning (DL) community found in pruning techniques a good way to reduce the models' resource and energy consumption. These techniques lead to smaller sparse models, but sparse computations in GPUs ...
Efficient high-precision integer multiplication on the GPU
(SAGE Journals, 2022-03)
[Abstract]: The multiplication of large integers, which has many applications in computer science, is an operation that can be expressed as a polynomial multiplication followed by a carry normalization. This work develops ...
A simulated annealing algorithm for zoning in planning using parallel computing
(Pergamon Press, 2016)
[Abstract] There is an increasing demand for tools that support land use planning processes, particularly the design of zoning maps, which is one of the most complex tasks in the field. In this task, different land use ...
Automated and accurate cache behavior analysis for codes with irregular access patterns
(John Wiley & Sons Ltd., 2007-04-03)
[Abstract] The memory hierarchy plays an essential role in the performance of current computers, so good analysis tools that help in predicting and understanding its behavior are required. Analytical modeling is the ideal ...
Using Artificial Vision Techniques for Individual Player Tracking in Sport Events
(M D P I AG, 2019-07-31)
[Abstract] We introduce a hybrid approach that can track an individual football player in a video sequence. This solution achieves a good balance between speed and accuracy, combining traditional object tracking techniques ...
Rendering of Bézier Surfaces on Handheld Devices
(Vaclav Skala - Union Agency, 2013)
[Abstract] Bézier surfaces have been widely employed in the designing of complex scenes with high-quality results. Nevertheless, parametric surfaces cannot be directly rendered in the current GPUs of modern handheld devices. ...
High-Speed FPGA Architecture for CABAC Decoding Acceleration in H.264/AVC Standard
(Springer New York LLC, 2013-08)
[Abstract] Video encoding and decoding are computing intensive applications that require high performance processors or dedicated hardware. Video decoding offers a high parallel processing potential that may be exploited. ...
Towards Low-Latency Model-Oriented Distributed Systems Management
(Springer, 2007)
[Abstract] Windows and Unix systems have been traditionally very different with regard to configuration storage and management. In this paper we have adapted our CIM-based model-driven management framework, AdCIM, to ...
Ontological Con guration Management for Wireless Mesh Routers
(Springer, 2009)
[Abstract] Wireless mesh networks (WMNs) are a category of wireless networks that are self-organized, robust and which offer more flexible client coverage with less equipment requirements than wired networks. In WMNs, mesh ...
Program Behavior Characterization Through Advanced Kernel Recognition
(Springer, 2007)
[Abstract] Understanding program behavior is at the foundation of program optimization. Techniques for automatic recognition of program constructs (from now on, computational kernels) characterize the behavior of program ...