On-Device Automatic Speech Recognition for Low-Resource Languages in Mixed Reality Industrial Metaverse Applications: Practical Guidelines and Evaluation of a Shipbuilding Application in Galician
| UDC.coleccion | Investigación | es_ES |
| UDC.departamento | Enxeñaría de Computadores | es_ES |
| UDC.endPage | 77038 | es_ES |
| UDC.grupoInv | Grupo de Tecnoloxía Electrónica e Comunicacións (GTEC) | es_ES |
| UDC.institutoCentro | CITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicación | es_ES |
| UDC.journalTitle | IEEE Access | es_ES |
| UDC.startPage | 77017 | es_ES |
| UDC.volume | 13 | es_ES |
| dc.contributor.author | Valladares Poncela, Antón | |
| dc.contributor.author | Fraga-Lamas, Paula | |
| dc.contributor.author | Fernández-Caramés, Tiago M. | |
| dc.date.accessioned | 2025-05-13T14:16:44Z | |
| dc.date.available | 2025-05-13T14:16:44Z | |
| dc.date.issued | 2025-04 | |
| dc.description.abstract | [Abstract]: As the Metaverse and Mixed Reality (MR) technologies continue to evolve, enabling natural and intuitive user interfaces is crucial. However, supporting low-resource languages in these advanced systems presents unique challenges. This article explores the development and deployment of an on-device Automatic Speech Recognition (ASR) system for Galician, a low-resource language spoken by less than 3 million people, implemented on the Microsoft HoloLens 2 MR glasses. The system prioritizes data privacy and security by eliminating the need for Internet connectivity or external processing. Key implementation choices, including software and libraries, are detailed, along with optimization strategies for minimizing latency. Performance evaluations, taking into account noise-simulated environments, demonstrate the high accuracy and low latency of the system, proving its effectiveness as an on-device ASR system for current and future Metaverse applications. In order to demonstrate the effectiveness of the developed system, it has been incorporated in an electrical outfitting application for Navantia, one of the largest shipbuilding companies in the world, illustrating its practical utility in an industrial scenario like a shipyard. The results obtained show a Character Error Rate (CER) below 6% and a latency of under 3 seconds using an ARM64 quantized model, which validates the effectiveness of the system for real-time voice control in industrial MR environments. | es_ES |
| dc.description.sponsorship | This work has been supported by Centro Mixto de Investigación UDC-NAVANTIA (IN853C 2022/01), funded by GAIN (Xunta de Galicia) and ERDF Galicia 2021-2027. Funding for open access charge: Universidade da Coruña/CISUG. | es_ES |
| dc.description.sponsorship | Xunta de Galicia; IN853C 2022/01 | es_ES |
| dc.description.sponsorship | Financiado para publicación en acceso aberto: Universidade da Coruña/CISUG | es_ES |
| dc.identifier.citation | A. Valladares-Poncela, P. Fraga-Lamas and T. M. Fernández-Caramés, "On-Device Automatic Speech Recognition for Low-Resource Languages in Mixed Reality Industrial Metaverse Applications: Practical Guidelines and Evaluation of a Shipbuilding Application in Galician," in IEEE Access, vol. 13, pp. 77017-77038, 2025, doi: 10.1109/ACCESS.2025.3564137 | es_ES |
| dc.identifier.doi | 10.1109/ACCESS.2025.3564137 | |
| dc.identifier.issn | 2169-3536 | |
| dc.identifier.uri | http://hdl.handle.net/2183/41982 | |
| dc.language.iso | eng | es_ES |
| dc.publisher | Institute of Electrical and Electronics Engineers | es_ES |
| dc.relation.uri | https://doi.org/10.1109/ACCESS.2025.3564137 | es_ES |
| dc.rights | Atribución 4.0 Internacional | es_ES |
| dc.rights.accessRights | open access | es_ES |
| dc.rights.uri | http://creativecommons.org/licenses/by/3.0/es/ | * |
| dc.subject | Automatic speech recognition (ASR) | es_ES |
| dc.subject | Extended reality | es_ES |
| dc.subject | Industrial metaverse | es_ES |
| dc.subject | IoT | es_ES |
| dc.subject | Microsoft HoloLens 2 | es_ES |
| dc.subject | Mixed Reality | es_ES |
| dc.title | On-Device Automatic Speech Recognition for Low-Resource Languages in Mixed Reality Industrial Metaverse Applications: Practical Guidelines and Evaluation of a Shipbuilding Application in Galician | es_ES |
| dc.type | journal article | es_ES |
| dc.type.hasVersion | VoR | es_ES |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | caa923d2-cf88-405e-9025-759d06cf3799 | |
| relation.isAuthorOfPublication | 79dbfabd-7261-41ff-9667-2f774d5f341e | |
| relation.isAuthorOfPublication.latestForDiscovery | caa923d2-cf88-405e-9025-759d06cf3799 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Valladares_Poncela_Anton_2025_On-Device_Automatic_Speech_Recognition_for_Low-Resource_Languages_in_Mixed_Reality_Industrial_Metaverse_Applications.pdf
- Size:
- 2.42 MB
- Format:
- Adobe Portable Document Format
- Description:

