🌙

对话中多模态情感识别的层次不确定性建模

Modeling Hierarchical Uncertainty for Multimodal Emotion Recognition in Conversation

IEEE Transactions on Cybernetics · 2022
被引 36
ABS 3

中文导读

提出HU-Dialogue模型,通过源自适应噪声扰动上下文注意力权重来建模上下文级不确定性,并采用贝叶斯深度学习方法建模模态级不确定性,在三个多模态情感识别数据集上超越现有方法。

Abstract

Approximating the uncertainty of an emotional AI agent is crucial for improving the reliability of such agents and facilitating human-in-the-loop solutions, especially in critical scenarios. However, none of the existing systems for emotion recognition in conversation (ERC) has attempted to estimate the uncertainty of their predictions. In this article, we present HU-Dialogue, which models hierarchical uncertainty for the ERC task. We perturb contextual attention weight values with source-adaptive noises within each modality, as a regularization scheme to model context-level uncertainty and adapt the Bayesian deep learning method to the capsule-based prediction layer to model modality-level uncertainty. Furthermore, a weight-sharing triplet structure with conditional layer normalization is introduced to detect both invariance and equivariance among modalities for ERC. We provide a detailed empirical analysis for extensive experiments, which shows that our model outperforms previous state-of-the-art methods on three popular multimodal ERC datasets.

对话系统多模态情感识别不确定性量化贝叶斯深度学习