Toward Human-Centered Explainability: Natural Language Explanations for Anomaly Detection

Padín Torrente, Héctor; Carneiro, Víctor; Ortega-Fernández, Inés

Toward Human-Centered Explainability: Natural Language Explanations for Anomaly Detection

UDC.coleccion	Investigación
UDC.departamento	Ciencias da Computación e Tecnoloxías da Información
UDC.grupoInv	Telemática
UDC.institutoCentro	CITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicación
UDC.journalTitle	Information Systems Frontiers
dc.contributor.author	Padín Torrente, Héctor
dc.contributor.author	Carneiro, Víctor
dc.contributor.author	Ortega-Fernández, Inés
dc.date.accessioned	2026-04-17T08:58:33Z
dc.date.available	2026-04-17T08:58:33Z
dc.date.issued	2026-04-10
dc.description	The benchmark dataset analyzed in this study is the CSE–CIC-IDS2018 maintained by the Canadian Institute for Cybersecurity, University of New Brunswick, openly accessible at https://www.unb.ca/cic/datasets/ids-2018.html. The evaluation data used to assess the models and the various data generation strategies consist of the predictions produced by the autoencoder in combination with SHAP outputs, applied to the attack instances within the dataset. This evaluation data, as well as prompt details, are available at GitHub (https://github.com/Gradiant/Natural-Language-Explanations-4-AD).
dc.description.abstract	[Abstract]: This paper proposes a human-centered explainable artificial intelligence pipeline for anomaly detection, designed to generate meaningful, context-aware explanations using local large language models. The proposed pipeline translates model outputs and SHAP-based feature attributions into natural language explanations for cybersecurity alerts generated by an autoencoder within an enterprise network. It incorporates a human-in-the-loop component to ground the explanations in validated expert knowledge, enhancing their interpretability and alignment with human decision-making processes. Using a rubric-driven LLM-as-a-Judge evaluation, we benchmark several large language models and show that as smaller models receive more contextual grounding through human-in-the-loop, their explanatory performance improves significantly, narrowing the gap with larger models while maintaining substantially lower computational demands. Our approach provides targeted, context-aware explanations designed to meet the cognitive and operational needs of security analysts, contributing to more ethical, trustworthy, and resource-efficient AI integration in critical cybersecurity environments.
dc.description.sponsorship	Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature. This publication is part of the project SafeNet UEBA (CPP001/23-R006_SafeNet_UEBA), financed by “European Union NextGeneration-EU, https://next-generation-eu.europa.eu/, the Recovery Plan, Transformation and Resilience, https://planderecuperacion.gob.es/, through INCIBE, https://www.incibe.es/”. This work was carried out at CITIC, within the framework of the project PID2023-150794OB-I00, funded by the Ministry of Science, Innovation and Universities (MICIU) and the State Research Agency (AEI)/10.13039/501100011033, and co-funded by the European Regional Development Fund (ERDF), European Union. CITIC, accredited as a center of excellence within the Galician University System and a member of the CIGUS Network, also receives support from the Department of Education, Science, Universities, and Vocational Training of the Xunta de Galicia, co-financed by the EU through the ERDF Galicia 2021–2027 programme (Ref. ED431G 2023/01).
dc.description.sponsorship	Xunta de Galicia; ED431G 2023/01
dc.description.sponsorship	Instituto Nacional de Ciberseguridad (INCIBE); CPP001/23-R006_SafeNet_UEBA
dc.identifier.citation	Padín-Torrente, H., Carneiro-Diaz, V. & Ortega-Fernandez, I. Toward Human-Centered Explainability: Natural Language Explanations for Anomaly Detection. Inf Syst Front (2026). https://doi.org/10.1007/s10796-026-10717-3
dc.identifier.doi	10.1007/s10796-026-10717-3
dc.identifier.issn	1572-9419
dc.identifier.uri	https://hdl.handle.net/2183/48029
dc.language.iso	eng
dc.publisher	Springer
dc.relation.isbasedon	https://www.unb.ca/cic/datasets/ids-2018.html
dc.relation.isbasedon	https://github.com/Gradiant/Natural-Language-Explanations-4-AD
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2023-150794OB-I00/ES/MEJORANDO LA DETECCION DE CIBER AMENAZAS USANDO MODELOS DE LENGUAJE DE GRAN TAMAÑO PARA PROTOCOLOS DE RED
dc.relation.uri	https://doi.org/10.1007/s10796-026-10717-3
dc.rights	Attribution 4.0 International	en
dc.rights.accessRights	open access
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/
dc.subject	Human-centered explainable AI
dc.subject	Natural language explanations
dc.subject	Human-in-the-loop
dc.subject	LLM-as-a-Judge
dc.subject	Anomaly detection
dc.title	Toward Human-Centered Explainability: Natural Language Explanations for Anomaly Detection
dc.type	journal article
dc.type.hasVersion	VoR
dspace.entity.type	Publication
relation.isAuthorOfPublication	652c136c-eea5-4a78-947c-538b1c99f81b
relation.isAuthorOfPublication.latestForDiscovery	652c136c-eea5-4a78-947c-538b1c99f81b

Files

Original bundle

Now showing 1 - 1 of 1

Name:: CarneiroDiaz_Victor_2026_Toward_Human_Centered_Explainability.pdf
Size:: 1.94 MB
Format:: Adobe Portable Document Format

Download

Collections

Investigación (FIC)