欢迎访问《图学学报》 分享到:

图学学报 ›› 2023, Vol. 44 ›› Issue (2): 280-290.DOI: 10.11996/JG.j.2095-302X.2023020280

• 图像处理与计算机视觉 • 上一篇    下一篇

基于多分支注意网络与相似度学习策略的无监督行人重识别

冯尊登(), 王洪元(), 林龙, 孙博言, 陈海琴   

  1. 常州大学计算机与人工智能学院,江苏 常州 213164
  • 收稿日期:2022-07-27 接受日期:2022-11-15 出版日期:2023-04-30 发布日期:2023-05-01
  • 通讯作者: 王洪元(1960-),男,教授,博士。主要研究方向为图像处理、人工智能、模式识别等。E-mail:hywang@cczu.edu.cn
  • 作者简介:冯尊登(1997-),男,硕士研究生。主要研究方向为计算机视觉、行人重识别。E-mail:1106351887@qq.com
  • 基金资助:
    国家自然科学基金项目(61976028);江苏省研究生科研与实践创新计划项目(KYCX22_3067)

Unsupervised person re-identification with multi-branch attention network and similarity learning strategy

FENG Zun-deng(), WANG Hong-yuan(), LIN Long, SUN Bo-yan, CHEN Hai-qin   

  1. School of Computer Science & Artificial Intelligence, Changzhou Jiangsu 213164, China
  • Received:2022-07-27 Accepted:2022-11-15 Online:2023-04-30 Published:2023-05-01
  • Contact: WANG Hong-yuan (1960-), professor, Ph.D. His main research interests cover image processing, computer vision, pattern recognition, etc. E-mail:hywang@cczu.edu.cn
  • About author:FENG Zun-deng (1997-), master student. His main research interests cover computer vision and person re-identification. E-mail:1106351887@qq.com
  • Supported by:
    National Natural Science Foundation of China(61976028);Postgraduate Research & Practice Innovation Program of Jiangsu Province(KYCX22_3067)

摘要:

无监督行人重识别的挑战在于学习没有真实标签的行人的判别性特征。为增强网络对行人特征的表达能力,进一步从空间和通道维度上提取更丰富的特征信息,提出了一种基于多分支注意网络的行人重识别特征提取方法。该方法通过捕获空间维度和通道维度上不同分支之间的交互信息,能够学习到更具判别性的行人特征表示。此外,针对噪声标签会对聚类质心产生干扰的问题,提出了相似度学习策略(SLS)。该策略先计算每个聚类中样本特征之间的相似性,然后选取相似性分数最高的特征向量所对应的样本进行对比学习,有效地缓解了聚类噪声导致的累积训练误差。实验结果表明,和无监督场景下的自步对比学习方法(SPCL)相比,在Market-1501,DukeMTMC-reID和MSMT17等3个数据集上的rank-1准确度分别提升了4.6%,3.3%和16.3%,显著地提高了无监督行人重识别的检索精度。

关键词: 无监督行人重识别, 多分支注意网络, 聚类质心, 相似度学习策略, 对比学习

Abstract:

The challenge facing the unsupervised person re-identification (Re-ID) lies in learning discriminative features without true labels. To address this, a person re-identification feature extraction method based on multi-branch attention network was proposed, in order to enhance the ability of the network to express pedestrian features and capture more abundant feature information from spatial and channel dimensions. This method could learn a more discriminative representation of pedestrian features by capturing the interaction information between different branches on the spatial dimension and the channel dimension. In addition, to tackle the issue of noisy labels interfering with cluster centroids, a similarity learning strategy (SLS) was proposed. This strategy first calculated the similarity between the sample features in each cluster, and then selected the samples corresponding to the feature vector with the highest similarity score for contrastive learning, thereby effectively mitigating the cumulative training error caused by noisy labels. The experimental results revealed that compared with the self-paced contrastive learning (SPCL) method in the unsupervised scenarios, the rank-1 precision on the three datasets Market1501, DukeMTMC-reID, and MSMT17 was increased by 4.6%, 3.3%, and 16.3%, respectively, significantly enhancing the retrieval accuracy of unsupervised person re-identification.

Key words: unsupervised person re-identification, multi-branch attention network, cluster centroid, similarity learning strategy, contrastive learning

中图分类号: