Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2023, Vol. 44 ›› Issue (4): 775-783.DOI: 10.11996/JG.j.2095-302X.2023040775

• Computer Graphics and Virtual Reality • Previous Articles     Next Articles

Generation and selection of synthetic data for cross-domain person re-identification

CAI Yi-wu1(), ZHANG Yu-jia1, ZHANG Yong-fei1,2()   

  1. 1. Beijing Key Laboratory of Digital Media, School of Computer Science and Engineering, Beihang University, Beijing 100191, China
    2. State Key Laboratory of Virtual Reality Technology and Systems, Beihang University, Beijing 100191, China
  • Received:2022-11-30 Accepted:2022-12-26 Online:2023-08-31 Published:2023-08-16
  • Contact: ZHANG Yong-fei (1982-), professor, Ph.D. His main research interests cover computer vision, etc. E-mail:yfzhang@buaa.edu.cn
  • About author:

    CAI Yi-wu (1999-), master student. His main research interest covers person re-identification. E-mail:caiyiwu@buaa.edu.cn

  • Supported by:
    National Natural Science Foundation of China(62072022)

Abstract:

The reliance of mainstream deep learning-based person re-identification models on large-scale labeled data for training is a costly process that requires extensive collection and labeling efforts. Additionally, the existing virtual data generation methods neglect to account for the characteristics of target domain, thereby compromising the performance of cross-domain re-identification. To address these issues, this paper proposed a synthetic data generation and selection algorithm for cross-domain person re-identification. First, this algorithm utilized the foreground information of the target domain, including the color distribution of individuals’ clothing, to guide the generation of virtual 3D human models. The background information of the target domain was employed to replace the background of source domain data. This served to enhance the data quality at the pixel level, while also guiding the model to distinguish different persons based on the foreground. Finally, the proposed method employed distribution metrics such as Wasserstein Distance to measure the feature distribution distance between the source domain and target domain. This distance was used to select the source domain training subset closest to the target domain for model training. The experimental results demonstrated the superiority of this method over other existing person virtual data generation algorithms, as it can significantly improve the cross-domain generalization performance of the person re-identification model.

Key words: person re-identification, data generation, virtual engine, data selection, distribution measure

CLC Number: