Now showing items 1-6 of 6
Compiler support for parallel code generation through kernel recognition
(IEEE Computer Society, 2004-06-07)
[Abstract] Summary form only given. The automatic parallelization of loops that contain complex computations is still a challenge for current parallelizing compilers. The main limitations are related to the analysis of ...
Efficient parallel numerical solver for the elastohydrodynamic Reynolds–Hertz problem
(Elsevier BV * North-Holland, 2001-12-01)
[Abstract] This work presents a parallel version of a complex numerical algorithm for solving an elastohydrodynamic piezoviscous lubrication problem studied in tribology. The numerical algorithm combines regula falsi, fixed ...
Automated and accurate cache behavior analysis for codes with irregular access patterns
(John Wiley & Sons Ltd., 2007-04-03)
[Abstract] The memory hierarchy plays an essential role in the performance of current computers, so good analysis tools that help in predicting and understanding its behavior are required. Analytical modeling is the ideal ...
XARK: an extensible framework for automatic recognition of computational kernels
(Association for Computing Machinery, 2008-10)
[Abstract] The recognition of program constructs that are frequently used by software developers is a powerful mechanism for optimizing and parallelizing compilers to improve the performance of the object code. The development ...
Program Behavior Characterization Through Advanced Kernel Recognition
[Abstract] Understanding program behavior is at the foundation of program optimization. Techniques for automatic recognition of program constructs (from now on, computational kernels) characterize the behavior of program ...
An Inspector-Executor Algorithm for Irregular Assignment Parallelization
[Abstract] A loop with irregular assignment computations contains loop-carried output data dependences that can only be detected at run-time. In this paper, a load-balanced method based on the inspector-executor model is ...