迈向以人为中心的可解释性:异常检测的自然语言解释

Toward Human-Centered Explainability: Natural Language Explanations for Anomaly Detection

Information Systems Frontiers · 2026
被引 0
ABS 3

中文导读

提出一种以人为中心的异常检测可解释AI流程,利用本地大语言模型将模型输出和特征归因转化为自然语言解释,并通过人机交互提升解释质量,适用于网络安全分析师。

Abstract

Abstract This paper proposes a human-centered explainable artificial intelligence pipeline for anomaly detection, designed to generate meaningful, context-aware explanations using local large language models. The proposed pipeline translates model outputs and SHAP-based feature attributions into natural language explanations for cybersecurity alerts generated by an autoencoder within an enterprise network. It incorporates a human-in-the-loop component to ground the explanations in validated expert knowledge, enhancing their interpretability and alignment with human decision-making processes. Using a rubric-driven LLM-as-a-Judge evaluation, we benchmark several large language models and show that as smaller models receive more contextual grounding through human-in-the-loop, their explanatory performance improves significantly, narrowing the gap with larger models while maintaining substantially lower computational demands. Our approach provides targeted, context-aware explanations designed to meet the cognitive and operational needs of security analysts, contributing to more ethical, trustworthy, and resource-efficient AI integration in critical cybersecurity environments.

异常检测可解释人工智能网络安全自然语言处理