欢迎访问《图学学报》 分享到:

图学学报 ›› 2023, Vol. 44 ›› Issue (1): 33-40.DOI: 10.11996/JG.j.2095-302X.2023010033

• 图像处理与计算机视觉 • 上一篇    下一篇

基于残差增强注意力的跨模态行人重识别

邵文斌(), 刘玉杰(), 孙晓瑞, 李宗民   

  1. 中国石油大学(华东)计算机科学与技术学院,山东 青岛 266580
  • 收稿日期:2022-04-26 修回日期:2022-06-13 出版日期:2023-10-31 发布日期:2023-02-16
  • 通讯作者: 刘玉杰
  • 作者简介:邵文斌(1998-),男,硕士研究生。主要研究方向为行人重识别、目标检测。E-mail:wbShao@s.upc.edu.cn
  • 基金资助:
    国家重点研发计划项目(2019YFF0301800);国家自然科学基金项目(61379106);山东省自然科学基金项目(ZR2013FM036);山东省自然科学基金项目(ZR2015FM011)

Cross modality person re-identification based on residual enhanced attention

SHAO Wen-bin(), LIU Yu-jie(), SUN Xiao-rui, LI Zong-min   

  1. College of Computer Science and Technology, China University of Petroleum (East China), Qingdao Shandong 266580, China
  • Received:2022-04-26 Revised:2022-06-13 Online:2023-10-31 Published:2023-02-16
  • Contact: LIU Yu-jie
  • About author:SHAO Wen-bin (1998-), master student. His main research interests cover person re-identification, object detection. E-mail:wbShao@s.upc.edu.cn
  • Supported by:
    National Key Research and Development Program(2019YFF0301800);National Natural Science Foundation of China(61379106);Shandong Provincial Natural Science Foundation(ZR2013FM036);Shandong Provincial Natural Science Foundation(ZR2015FM011)

摘要:

跨模态行人重识别主要面临2个问题:①成像机制不同所导致的红外图像和可见光图像之间的模态差异;②图像特征的身份判别性不足导致的类内差异。针对上述2个问题,基于残差增强注意力的跨模态行人重识别方法被提出用来提高行人特征的模态不变性和身份判别性。首先,设计网络浅层参数独立、网络深层参数共享的双路卷积神经网络作为骨干网络。然后,分析现有注意力机制存在的全局弱化,设计了残差增强注意力方法解决该问题,提升注意力机制的性能,将其分别应用在网络浅层的通道维度和深层的空间位置上,提升模型对于模态差异的消除能力和行人特征的身份鉴别能力。在SYSU-MM01和RegDB 2个数据集上进行的实验证明了该方法的先进性,大量的对比实验也充分证明本文方法的有效性。

关键词: 行人重识别, 残差增强, 注意力机制, 模态不变性, 神经网络

Abstract:

Cross modality person re-identification mainly faces two problems: ① Modality discrepancies between infrared and visible images caused by different imaging mechanisms. ② Intra-class discrepancies caused by the insufficient identity discrimination of features. In order to address the above two problems, a cross modality person re-identification method based on residual enhanced attention was proposed to improve the modality invariance and identity discrimination of pedestrian features. First, with non-shared parameters at the shallow network and shared parameters at the deep layer a dual-stream convolutional neural network was designed as the backbone. Then, the problem of global weakening in the existing attention mechanism was analyzed, and a residual enhancement method was designed to solve this problem and improve the performance of the attention mechanism. It was applied to the shallow channel dimension and deep spatial location of the network respectively. Sufficient experiments on the two datasets SYSU-MM01 and RegDB have proved the effectiveness of the method.

Key words: person re-identification, residual enhancement, attention mechanism, modality invariance, neural network

中图分类号: