Mostrar o rexistro simple do ítem
OpenCNN: A Winograd Minimal Filtering Algorithm Implementation in CUDA
dc.contributor.author | López Castro, Roberto | |
dc.contributor.author | Andrade, Diego | |
dc.contributor.author | Fraguela, Basilio B. | |
dc.date.accessioned | 2021-09-09T16:34:55Z | |
dc.date.available | 2021-09-09T16:34:55Z | |
dc.date.issued | 2021 | |
dc.identifier.citation | Castro, R.L.; Andrade, D.; Fraguela, B.B. OpenCNN: A Winograd Minimal Filtering Algorithm Implementation in CUDA. Mathematics 2021, 9, 2033. https://doi.org/10.3390/math9172033 | es_ES |
dc.identifier.uri | http://hdl.handle.net/2183/28443 | |
dc.description.abstract | [Abstract] Improving the performance of the convolution operation has become a key target for High Performance Computing (HPC) developers due to its prevalence in deep learning applied mainly to video processing. The improvement is being pushed by algorithmic and implementation innovations. Algorithmically, the convolution can be solved as it is mathematically enunciated, but other methods allow to transform it into a Fast Fourier Transform (FFT) or a GEneral Matrix Multiplication (GEMM). In this latter group, the Winograd algorithm is a state-of-the-art variant that is specially suitable for smaller convolutions. In this paper, we present openCNN, an optimized CUDA C++ implementation of the Winograd convolution algorithm. Our approach achieves speedups of up to 1.76× on Turing RTX 2080Ti and up to 1.85× on Ampere RTX 3090 with respect to Winograd convolution in cuDNN 8.2.0. OpenCNN is released as open-source software. | es_ES |
dc.description.sponsorship | This research was supported by the Ministry of Science and Innovation of Spain (PID2019-104184RB-I00, AEI/FEDER/EU, 10.13039/501100011033) and the predoctoral grant of Roberto L. Castro (FPU19/03974). and by the Xunta de Galicia co-founded by the European Regional Development Fund (ERDF) under the Consolidation Programme of Competitive Reference Groups (ED431C 2021/30). CITIC, Centro de Investigación de Galicia ref. ED431G 2019/01, receives financial support from Consellería de Educación, Universidade e Formación Profesional, Xunta de Galicia, through the ERDF (80%) and Secretaría Xeral de Universidades (20%) | es_ES |
dc.description.sponsorship | Xunta de Galicia; ED431C 2021/30 | es_ES |
dc.description.sponsorship | Xunta de Galicia; ED431G 2019/01 | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | MDPI | es_ES |
dc.relation | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2019-104184RB-I00/ES/DESAFIOS ACTUALES EN HPC: ARQUITECTURAS, SOFTWARE Y APLICACIONES | |
dc.relation | info:eu-repo/grantAgreement//Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/FPU19%2F03974/ES/ | |
dc.relation.uri | https://doi.org/10.3390/math9172033 | es_ES |
dc.rights | Atribución 4.0 Internacional | es_ES |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | * |
dc.subject | Deep learning | es_ES |
dc.subject | Convolution | es_ES |
dc.subject | Winograd | es_ES |
dc.subject | CUDA | es_ES |
dc.title | OpenCNN: A Winograd Minimal Filtering Algorithm Implementation in CUDA | es_ES |
dc.type | info:eu-repo/semantics/article | es_ES |
dc.rights.access | info:eu-repo/semantics/openAccess | es_ES |
UDC.journalTitle | Mathematics | es_ES |
UDC.volume | 9 | es_ES |
UDC.startPage | 2033 | es_ES |
dc.identifier.doi | 10.3390/math9172033 |
Ficheiros no ítem
Este ítem aparece na(s) seguinte(s) colección(s)
-
GI-GAC - Artigos [192]