Skip navigation
  •  Home
  • UDC 
    • Getting started
    • RUC Policies
    • FAQ
    • FAQ on Copyright
    • More information at INFOguias UDC
  • Browse 
    • Communities
    • Browse by:
    • Issue Date
    • Author
    • Title
    • Subject
  • Help
    • español
    • Gallegan
    • English
  • Login
  •  English 
    • Español
    • Galego
    • English
  
View Item 
  •   DSpace Home
  • Facultade de Informática
  • Investigación (FIC)
  • View Item
  •   DSpace Home
  • Facultade de Informática
  • Investigación (FIC)
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Exploiting locality in the run-time parallelization of irregular loops

Thumbnail
View/Open
M.J.Martín_2003_Exploiting _Locality_in_the_Run-Time_Parallelization_of_Irregular_Loops.pdf (233.9Kb)
Use this link to cite
http://hdl.handle.net/2183/23357
Collections
  • Investigación (FIC) [1685]
Metadata
Show full item record
Title
Exploiting locality in the run-time parallelization of irregular loops
Author(s)
Martín, María J.
Singh, David E.
Touriño, Juan
Rivera, Francisco F.
Date
2002-12-10
Citation
M. J. Martin, D. E. Singh, J. Tourino and F. F. Rivera, "Exploiting locality in the run-time parallelization of irregular loops," Proceedings International Conference on Parallel Processing, Vancouver, BC, Canada, 2002, pp. 27-34.
Abstract
[Abstract] The goal of this work is the efficient parallel execution of loops with indirect array accesses, in order to be embedded in a parallelizing compiler framework. In this kind of loop pattern, dependences can not always be determined at compile-time as, in many cases, they involve input data that are only known at run-time and/or the access pattern is too complex to be analyzed In this paper we propose runtime strategies for the parallelization of these loops. Our approaches focus not only on extracting parallelism among iterations of the loop, but also on exploiting data access locality to improve memory hierarchy behavior and, thus, the overall program speedup. Two strategies are proposed one based on graph partitioning techniques and other based on a block-cyclic distribution. Experimental results show that both strategies are complementary and the choice of the best alternative depends on some features of the loop pattern.
Keywords
Runtime
Parallel processing
Data mining
Computer science
Pattern analysis
Sparse matrices
Fluid flow
Fluid dynamics
Finite element method
Degradation
 
Description
This is a post-peer-review, pre-copyedit version of an article published. The final authenticated version is available online at: http://dx.doi.org/10.1109/ICPP.2002.1040856
Editor version
http://dx.doi.org/10.1109/ICPP.2002.1040856
ISSN
0190-3918
ISBN
0-7695-1677-7

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsResearch GroupAcademic DegreeThis CollectionBy Issue DateAuthorsTitlesSubjectsResearch GroupAcademic Degree

My Account

LoginRegister

Statistics

View Usage Statistics
Sherpa
OpenArchives
OAIster
Scholar Google
UNIVERSIDADE DA CORUÑA. Servizo de Biblioteca.    DSpace Software Copyright © 2002-2013 Duraspace - Send Feedback