RBDF:用于可见光红外行人重识别的互逆双向框架

RBDF: Reciprocal Bidirectional Framework for Visible Infrared Person Reidentification

IEEE Transactions on Cybernetics · 2022
被引 32
ABS 3

中文导读

提出互逆双向框架,通过双向图像翻译子网络学习可见光与红外模态间的映射,生成高相似度异构图像,消除模态差异,在SYSU-MM01数据集上达到54.41% mAP和57.66% rank-1准确率,超越现有方法。

Abstract

Visible infrared person reidentification (VI-REID) plays a critical role in night-time surveillance applications. Most methods attempt to reduce the cross-modality gap by extracting the modality-shared features. However, they neglect the distinct image-level discrepancies among heterogeneous pedestrian images. In this article, we propose a reciprocal bidirectional framework (RBDF) to achieve modality unification before discriminative feature learning. The bidirectional image translation subnetworks can learn two opposite mappings between visible and infrared modality. Particularly, we investigate the characteristics of the latent space and design a novel associated loss to pull close the distribution between the intermediate representations of two mappings. Mutual interaction between two opposite mappings helps the network generate heterogeneous images that have high similarity with the real images. Hence, the concatenation of original and generated images can eliminate the modality gap. During the feature learning procedure, the attention mechanism-based feature embedding network can learn more discriminative representations with the identity classification and feature metric learning. Experimental results indicate that our method achieves state-of-the-art performance. For instance, we achieve 54.41% mAP and 57.66% rank-1 accuracy on SYSU-MM01 dataset, outperforming the existing works by a large margin.

行人重识别可见光红外跨模态图像翻译特征学习监控系统