Sustainable Techniques to Improve Data Quality for Training Image-Based Explanatory Models for Recommender Systems

Paz Ruza, Jorge; Esteban Martínez, David; Alonso-Betanzos, Amparo; Guijarro-Berdiñas, Bertha

Sustainable Techniques to Improve Data Quality for Training Image-Based Explanatory Models for Recommender Systems

UDC.coleccion	Investigación
UDC.conferenceTitle	34th International Conference on Artificial Neural Networks (ICANN 2025)
UDC.departamento	Ciencias da Computación e Tecnoloxías da Información
UDC.grupoInv	Laboratorio de Investigación e Desenvolvemento en Intelixencia Artificial (LIDIA)
UDC.institutoCentro	CITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicación
UDC.volume	16070
dc.contributor.author	Paz Ruza, Jorge
dc.contributor.author	Esteban Martínez, David
dc.contributor.author	Alonso-Betanzos, Amparo
dc.contributor.author	Guijarro-Berdiñas, Bertha
dc.date.accessioned	2025-11-11T09:45:40Z
dc.date.available	2025-11-11T09:45:40Z
dc.date.issued	2025-09-10
dc.description.abstract	[Abstract]: Visual explanations based on user-uploaded images are an effective and self-contained approach to provide transparency to Recommender Systems (RS), but intrinsic limitations of data used in this explainability paradigm cause existing approaches to use bad quality training data that is highly sparse and suffers from labelling noise. Popular training enrichment approaches like model enlargement or massive data gathering are expensive and environmentally unsustainable, thus we seek to provide better visual explanations to RS aligning with the principles of Responsible AI. In this work, we research the intersection of effective and sustainable training enrichment strategies for visual-based RS explainability models by developing three novel strategies that focus on training Data Quality: 1) selection of reliable negative training examples using Positive-unlabelled Learning, 2) transform-based data augmentation, and 3) text-to-image generative-based data augmentation. The integration of these strategies in three state-of-the-art explainability models increases (5%) the performance in relevant ranking metrics of these visual-based RS explainability models without penalizing their practical long-term sustainability, as tested in multiple real-world restaurant recommendation explanation datasets
dc.description.sponsorship	This work is funded by MICIU/AEI and ESF+ (FPU21/05783), ERDF A way of making Europe (PID2019-109238GB-C22, PID2023-147404OB-I00), ERDF/EU (PID2021-128045OA-I00), Ministry for Digital Transformation and Civil Service and ‘Next-GenerationEU’/PRTR (TSI-100925-2023-1), and Xunta de Galicia (ED431C 2022/44). CITIC is funded by “Consellería de Cultura, Educación e Universidade” through ERDF and “Secretaría Xeral de Universidades" (ED431G 2023/01).
dc.description.sponsorship	Xunta de Galicia; ED431C 2022/44
dc.description.sponsorship	Xunta de Galicia; ED431G 2023/01
dc.identifier.citation	Paz-Ruza, J., Esteban-Martínez, D., Alonso-Betanzos, A., Guijarro-Berdiñas, B. (2026). Sustainable Techniques to Improve Data Quality for Training Image-Based Explanatory Models for Recommender Systems. In: Senn, W., et al. Artificial Neural Networks and Machine Learning – ICANN 2025. ICANN 2025. Lecture Notes in Computer Science, vol 16070. Springer, Cham. https://doi.org/10.1007/978-3-032-04549-2_19
dc.identifier.isbn	978-3-032-04548-5
dc.identifier.isbn	978-3-032-04549-2
dc.identifier.issn	1611-3349
dc.identifier.issn	0302-9743
dc.identifier.uri	https://hdl.handle.net/2183/46391
dc.language.iso	eng
dc.publisher	Springer Nature
dc.relation.projectID	info:eu-repo/grantAgreement/MECD/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/FPU21%2F05783/ES/
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2019-109238GB-C22/ES/APRENDIZAJE AUTOMATICO ESCALABLE Y EXPLICABLE/
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2023-147404OB-I00/ES/APRENDIZAJE AUTOMATICO FRUGAL: POTENCIANDO LA IA EN ENTORNOS CON RECURSOS LIMITADOS PARA LOS DESAFIOS DEL MUNDO REAL
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2021-128045OA-I00/ES/APRENDIZAJE PROFUNDO ETICO
dc.relation.projectID	info:eu-repo/grantAgreement/MTDPF//TSI-100925-2023-1/ES/CÁTEDRA UDC-INDITEX DE IA EN ALGORITMOS VERDES
dc.relation.uri	https://doi.org/10.1007/978-3-032-04549-2_19
dc.rights	© 2026 The Author(s), under exclusive license to Springer Nature Switzerland AG
dc.rights.accessRights	embargoed access
dc.subject	Machine Learning
dc.subject	eXplainable AI
dc.subject	Frugal AI
dc.subject	Recommender Systems
dc.subject	Positive-Unlabelled Learning
dc.subject	Data Quality
dc.title	Sustainable Techniques to Improve Data Quality for Training Image-Based Explanatory Models for Recommender Systems
dc.type	conference output
dspace.entity.type	Publication
relation.isAuthorOfPublication	c91f7d18-38fb-42b8-8be2-b402a40b10c5
relation.isAuthorOfPublication	a89f1cad-dbc5-471f-986a-26c021ed4a95
relation.isAuthorOfPublication	d839396d-454e-4ccd-9322-d3e89a876865
relation.isAuthorOfPublication.latestForDiscovery	c91f7d18-38fb-42b8-8be2-b402a40b10c5

Files

Original bundle

Now showing 1 - 1 of 1

Name:: PazRuza_Jorge_2025_Sustainable_tec_data_quality_recomm_sys.pdf
Size:: 4.09 MB
Format:: Adobe Portable Document Format

(2026-09-10) Download

Collections

Investigación (FIC)