Hybrid LLM and Sentence Transformer Evaluation for ITS Programming Exercises

UDC.coleccionInvestigación
UDC.conferenceTitleICETC 2025
UDC.departamentoCiencias da Computación e Tecnoloxías da Información
UDC.endPage225
UDC.grupoInvLaboratorio de Investigación e Desenvolvemento en Intelixencia Artificial (LIDIA)
UDC.grupoInvGrupo Integrado de Enxeñaría (GII)
UDC.institutoCentroCITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicación
UDC.startPage221
dc.contributor.authorOrtega Morla, Javier
dc.contributor.authorFerreiro Villamor, Daniel
dc.contributor.authorPaz-López, Alejandro
dc.contributor.authorPérez-Sánchez, Beatriz
dc.contributor.authorGuerreiro-Santalla, Sara
dc.date.accessioned2026-04-21T07:30:09Z
dc.date.available2026-04-21T07:30:09Z
dc.date.issued2025
dc.description© 2025 IEEE. This version of the article has been accepted for publication, after peer review. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The Version of Record is available online at: https://doi.org/10.1109/ICETC66579.2025.11387358 Presented at: 2025 International Conference on Education Technology and Computers (ICETC), 18-21 September 2025, Barcelona, Spain
dc.description.abstract[Abstract]: This paper explores automatic evaluation and feedback generation for varied student output in programming exercises. To this end, a hybrid approach was implemented, combining Sentence Transformer Models for output classification with General Purpose Large Language Models for generating brief, informative feedback. The solution was tested on a dataset of 1,300 manually labeled student outputs from programming exams at the University of A Coruña. Finally, this system will be integrated into ProgTutor, an Intelligent Tutoring System, to replace the current evaluation method and enable its assessment in large-scale, real-world conditions.
dc.description.sponsorshipThe TED2021-131172B-I00 grant was funded by MCIN/AEI and the European Union NextGenerationEU/PRTR. Support was also received from the Galician Research Center (CITIC), funded by the Government of Galicia and the European Union (FEDER GALICIA 2014- 2020 program), through grant ED431G 2019/01, as well as from the Galician Government group (ED431C 2022/44 and ED431C 2021/39) with FEDER funds.
dc.description.sponsorshipXunta de Galicia; ED431G 2019/01
dc.description.sponsorshipXunta de Galicia; ED431C 2022/44
dc.description.sponsorshipXunta de Galicia; ED431C 2021/39
dc.identifier.citationJ. Ortega-Morla, D. Ferreiro, A. Paz-Lopez, B. Pérez-Sánchez and S. Guerreiro-Santalla, "Hybrid LLM and Sentence Transformer Evaluation for ITS Programming Exercises," 2025 International Conference on Education Technology and Computers (ICETC), Barcelona, Spain, 2025, pp. 221-225, doi: 10.1109/ICETC66579.2025.11387358.
dc.identifier.doi10.1109/ICETC66579.2025.11387358
dc.identifier.isbn979-8-3315-9791-7
dc.identifier.urihttps://hdl.handle.net/2183/48046
dc.language.isoeng
dc.publisherIEEE
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/TED2021-131172B-I00/ES/TUTOR DE PROGRAMACION INTELIGENTE MEDIANTE ROBOTICA SIMULADA
dc.relation.urihttps://doi.org/10.1109/ICETC66579.2025.11387358
dc.rightsCopyright © 2025, IEEE
dc.rights.accessRightsopen access
dc.subjectIntelligent Tutoring Systems
dc.subjectITS
dc.subjectLarge Language Models
dc.subjectLLM
dc.subjectAutomatic Evaluation
dc.titleHybrid LLM and Sentence Transformer Evaluation for ITS Programming Exercises
dc.typeconference output
dspace.entity.typePublication
relation.isAuthorOfPublication5a2c7afc-804a-4032-b8ed-604ec7355150
relation.isAuthorOfPublication1729347a-a5bc-4ab0-a914-6c7a1dce7eb9
relation.isAuthorOfPublicatione3b7030b-b748-473b-9914-a447f81bf8d2
relation.isAuthorOfPublication.latestForDiscovery5a2c7afc-804a-4032-b8ed-604ec7355150

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
PerezSanchez_Beatriz_2025_Hybrid_LLM_and_Sentence_Transformer_Evaluation.pdf
Size:
508.48 KB
Format:
Adobe Portable Document Format