Spatial–temporal feature-based End-to-end Fourier network for 3D sign language recognition
| UDC.coleccion | Investigación | es_ES |
| UDC.departamento | Ciencias da Computación e Tecnoloxías da Información | es_ES |
| UDC.endPage | 15 | es_ES |
| UDC.grupoInv | Laboratorio de Investigación e Desenvolvemento en Intelixencia Artificial (LIDIA) | es_ES |
| UDC.issue | Article 123258 | es_ES |
| UDC.journalTitle | Expert Systems with Applications | es_ES |
| UDC.startPage | 1 | es_ES |
| UDC.volume | 248 | es_ES |
| dc.contributor.author | Abdullahi, Sunusi Bala | |
| dc.contributor.author | Chamnongthai, Kosin | |
| dc.contributor.author | Bolón-Canedo, Verónica | |
| dc.contributor.author | Cancela, Brais | |
| dc.date.accessioned | 2024-11-19T19:24:54Z | |
| dc.date.embargoEndDate | 2026-08-15 | es_ES |
| dc.date.embargoLift | 2026-08-15 | |
| dc.date.issued | 2024-08-15 | |
| dc.description | This is the Accepted Manuscript. This version of the article has been accepted for publication in: Expert Systems with Applications, 248, 123258. The Version of Record is available online at https://doi.org/10.1016/j.eswa.2024.123258. | es_ES |
| dc.description.abstract | [Abstract]: Most dynamic sign word misclassifications are caused by redundant spatial–temporal (SPT) feature pruning that lacks language semantic and temporal dependencies. SPT feature recognition is one of the important aspects for the evaluation of the misclassification of dynamic sign words. The redundant pruning of SPT feature space influences the language model of sign confusion, model complexity, and SPT feature similarity. The purpose of this article is to develop a new multi-scale SPT feature-based dynamic sign word recognition approach via a low-cost feature selection method (FS) and End-to-end Fourier convolution neural network (EFCNN). Instead of a sensor fusion technique for obtaining frame position alignment, in the EFCNN, new 3D frame position and coordinates are determined using a pixel weighting and alignment function of the first and succeeding 25 spatial intensities of the 3D video changes across hand motion. The new spatial weight and the original spatial coordinates are fused and truncated in the Fourier domain. We generate the temporal dependence of the fused features. A feature selection known as the FS-EFCNN is introduced to select compact features with a preserved language meaning. Five state-of-the-art feature selection methods, namely Infinite FS (InFS), Relief FS, Fisher, MIM, ILFS, and ensemble FS-EFCNN were deployed to guide and optimize the learning performance of EFCNN. The experimental result analysis highlighted the improved results of the FS-EFCNN method with the best accuracy of 99.86%, 99.89%, and 90.69% on 3D American Sign Language, British Sign Language, and Greek Sign Language data sets, respectively. | es_ES |
| dc.description.sponsorship | This research has been financially supported in part by the Spanish Ministerio de Ciencia e Innovación MCIN/AEI/10.13039/501100011033 and ”NextGenerationEU”/PRTR under Grants [PID2019-109238GB-C22; PID2021-128045OA-I00; TED2021-130599A-I00], and by the Xunta de Galicia (ED431C 2022/44) with the European Union ERDF funds. CITIC, as a Research Center of the University System of Galicia, is funded by Consellería de Educación, Universidade e Formación Profesional of the Xunta de Galicia, Spain through the European Regional Development Fund (ERDF) and the Secretaría Xeral de Universidades (Ref. ED431G 2019/01). This research is also supported by King Mongkut’s University of Technology Thonburi’s Postdoctoral Fellowship Under Research Project ID 27180. | es_ES |
| dc.description.sponsorship | Xunta de Galicia; ED431C 2022/44 | es_ES |
| dc.description.sponsorship | Xunta de Galicia; ED431G 2019/01 | es_ES |
| dc.description.sponsorship | Thailand. King Mongkut's University of Technology Thonburi; 27180 | es_ES |
| dc.identifier.citation | Abdullahi, S. B., Chamnongthai, K., Bolon-Canedo, V., & Cancela, B. (2024). Spatial–temporal feature-based End-to-end Fourier network for 3D sign language recognition. Expert Systems with Applications, 248, 123258. https://doi.org/10.1016/j.eswa.2024.123258 | es_ES |
| dc.identifier.doi | 10.1016/j.eswa.2024.123258 | |
| dc.identifier.issn | 0957-4174 | |
| dc.identifier.issn | 1873-6793 | |
| dc.identifier.uri | http://hdl.handle.net/2183/40192 | |
| dc.language.iso | eng | es_ES |
| dc.publisher | Elsevier | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2019-109238GB-C22/ES/APRENDIZAJE AUTOMATICO ESCALABLE Y EXPLICABLE | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/MICINN/Plan Estatal de Investigación Científica, Técnica y de Innovación 2021-2023/PID2021-128045OA-I00/ES/APRENDIZAJE PROFUNDO ÉTICO | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2024/TED2021-130599A-I00/ES/ALGORITMOS DE SELECCIÓN DE CARACTERÍSTICAS VERDES Y RÁPIDOS | es_ES |
| dc.relation.uri | https://doi.org/10.1016/j.eswa.2024.123258. | es_ES |
| dc.rights | Atribución-NoComercial-SinDerivadas 4.0 Internacional | es_ES |
| dc.rights | © 2024. This manuscript version is made available under the CC-BY-NC-ND 4.0 license https://creativecommons.org/licenses/by-nc-nd/4.0/. | es_ES |
| dc.rights.accessRights | embargoed access | es_ES |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ | * |
| dc.subject | End-to-end deep learning network | es_ES |
| dc.subject | Feature selection | es_ES |
| dc.subject | Fourier convolution | es_ES |
| dc.subject | Hand gestures | es_ES |
| dc.subject | Natural language processing | es_ES |
| dc.subject | Sign language recognition | es_ES |
| dc.subject | Spatial–temporal information | es_ES |
| dc.title | Spatial–temporal feature-based End-to-end Fourier network for 3D sign language recognition | es_ES |
| dc.type | journal article | es_ES |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | c114dccd-76e4-4959-ba6b-7c7c055289b1 | |
| relation.isAuthorOfPublication | ba91aca1-bdb4-4be5-b686-463937924910 | |
| relation.isAuthorOfPublication.latestForDiscovery | c114dccd-76e4-4959-ba6b-7c7c055289b1 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Bolon_Canedo_Veronica_2024_Spatial–temporal_feature-based_End-to-end_Fourier_network.pdf
- Size:
- 1.96 MB
- Format:
- Adobe Portable Document Format
- Description:

