Skip navigation
  •  Inicio
  • UDC 
    • Cómo depositar
    • Políticas do RUC
    • FAQ
    • Dereitos de Autor
    • Máis información en INFOguías UDC
  • Percorrer 
    • Comunidades
    • Buscar por:
    • Data de publicación
    • Autor
    • Título
    • Materia
  • Axuda
    • español
    • Gallegan
    • English
  • Acceder
  •  Galego 
    • Español
    • Galego
    • English
  
Ver ítem 
  •   RUC
  • Facultade de Informática
  • Investigación (FIC)
  • Ver ítem
  •   RUC
  • Facultade de Informática
  • Investigación (FIC)
  • Ver ítem
JavaScript is disabled for your browser. Some features of this site may not work without it.

Optimizing Coherence Traffic in Manycore Processors Using Closed-Form Caching/Home Agent Mappings

Thumbnail
Ver/abrir
S.Kommrusch_2021_Optimizing_Coherence_Traffic_in_Manycore.pdf (1.767Mb)
Use este enlace para citar
http://hdl.handle.net/2183/27758
Atribución 3.0 España
A non ser que se indique outra cousa, a licenza do ítem descríbese como Atribución 3.0 España
Coleccións
  • Investigación (FIC) [1678]
Metadatos
Mostrar o rexistro completo do ítem
Título
Optimizing Coherence Traffic in Manycore Processors Using Closed-Form Caching/Home Agent Mappings
Autor(es)
Kommrusch, Steve
Horro, Marcos
Pouchet, Louis-Noël
Rodríguez, Gabriel
Touriño, Juan
Data
2021-02-09
Cita bibliográfica
S. Kommrusch, M. Horro, L. -N. Pouchet, G. Rodríguez and J. Touriño, "Optimizing Coherence Traffic in Manycore Processors Using Closed-Form Caching/Home Agent Mappings," in IEEE Access, vol. 9, pp. 28930-28945, 2021, doi: 10.1109/ACCESS.2021.3058280.
Resumo
[Abstract] Manycore processors feature a high number of general-purpose cores designed to work in a multithreaded fashion. Recent manycore processors are kept coherent using scalable distributed directories. A paramount example is the Intel Mesh interconnect, which consists of a network-on-chip interconnecting “tiles”, each of which contains computation cores, local caches, and coherence masters. The distributed coherence subsystem must be queried for every out-of-tile access, imposing an overhead on memory latency. This paper studies the physical layout of an Intel Knights Landing processor, with a particular focus on the coherence subsystem, and uncovers the pseudo-random mapping function of physical memory blocks across the pieces of the distributed directory. Leveraging this knowledge, candidate optimizations to improve memory latency through the minimization of coherence traffic are studied. Although these optimizations do improve memory throughput, ultimately this does not translate into performance gains due to inherent overheads stemming from the computational complexity of the mapping functions.
Palabras chave
Coherence
Layout
Optimization
Manycore processors
Dynamic scheduling
Task analysis
Standards
 
Versión do editor
https://doi.org/10.1109/ACCESS.2021.3058280
Dereitos
Atribución 3.0 España
ISSN
2169-3536

Listar

Todo RUCComunidades e colecciónsPor data de publicaciónAutoresTítulosMateriasGrupo de InvestigaciónTitulaciónEsta colecciónPor data de publicaciónAutoresTítulosMateriasGrupo de InvestigaciónTitulación

A miña conta

AccederRexistro

Estatísticas

Ver Estatísticas de uso
Sherpa
OpenArchives
OAIster
Scholar Google
UNIVERSIDADE DA CORUÑA. Servizo de Biblioteca.    DSpace Software Copyright © 2002-2013 Duraspace - Suxestións