Post-hoc categorization based on explainable AI and reinforcement learning for improved intrusion detection

Larriva Novo, Xavier Andrés ORCID: https://orcid.org/0000-0001-5335-5698, Pérez Miguel, Luis ORCID: https://orcid.org/0000-0002-8216-8039, Villagra González, Víctor Abraham ORCID: https://orcid.org/0000-0002-7067-6968, Álvarez-Campana Fernández-Corredor, Manuel ORCID: https://orcid.org/0000-0003-2747-9798, Sánchez Zas, Carmen ORCID: https://orcid.org/0000-0003-0791-6946 and Jover Walsh, Óscar ORCID: https://orcid.org/0009-0006-2026-147X (2024). Post-hoc categorization based on explainable AI and reinforcement learning for improved intrusion detection. "Applied Sciences", v. 14 (n. 24); p. 11511. ISSN 2076-3417. https://doi.org/10.3390/app142411511.

Descripción

Título: Post-hoc categorization based on explainable AI and reinforcement learning for improved intrusion detection
Autor/es:
Tipo de Documento: Artículo
Título de Revista/Publicación: Applied Sciences
Fecha: 1 Diciembre 2024
ISSN: 2076-3417
Volumen: 14
Número: 24
Materias:
ODS:
Palabras Clave Informales: Cybersecurity; IDS; reinforcement learning; SHAP; UNSW-NB15; XAI
Escuela: E.T.S.I. Telecomunicación (UPM)
Departamento: Ingeniería de Sistemas Telemáticos
Licencias Creative Commons: Reconocimiento

Texto completo

[thumbnail of 10316840.pdf] PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (1MB)

Resumen

The massive usage of Internet services nowadays has led to a drastic increase in cyberattacks, including sophisticated techniques, so that Intrusion Detection Systems (IDSs) need to use AP technologies to enhance their effectiveness. However, this has resulted in a lack of interpretability and explainability from different applications that use AI predictions, making it hard to understand by cybersecurity operators why decisions were made. To address this, the concept of Explainable AI (XAI) has been introduced to make the AI's decisions more understandable at both global and local levels. This not only boosts confidence in the AI but also aids in identifying different attributes commonly used in cyberattacks for the exploitation of flaws or vulnerabilities. This study proposes two developments: first, the creation and evaluation of machine learning models for an IDS with the objective to use Reinforcement Learning (RL) to classify malicious network traffic, and second, the development of a methodology to extract multi-level explanations from the RL model to identify, detect, and understand how different attributes affect uncertain types of attack categories.

Más información

ID de Registro: 89074
Identificador DC: https://oa.upm.es/89074/
Identificador OAI: oai:oa.upm.es:89074
URL Portal Científico: https://portalcientifico.upm.es/es/ipublic/item/10316840
Identificador DOI: 10.3390/app142411511
URL Oficial: https://www.mdpi.com/2076-3417/14/24/11511
Depositado por: iMarina Portal Científico
Depositado el: 14 May 2025 10:08
Ultima Modificación: 30 Ene 2026 09:12