Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2025, Vol. 46 ›› Issue (2): 393-401.DOI: 10.11996/JG.j.2095-302X.2025020393

• Computer Graphics and Virtual Reality • Previous Articles     Next Articles

3D human pose and shape estimation from single-view point clouds with semi-supervised learning

FANG Chenghao(), WANG Kangkan()   

  1. Key Laboratory of Intelligent Perception and Systems for High-Dimensional Information of Ministry of Education, Nanjing University of Science and Technology, Nanjing Jiangsu 210094, China
  • Received:2024-07-05 Accepted:2024-11-27 Online:2025-04-30 Published:2025-04-24
  • Contact: WANG Kangkan
  • About author:First author contact:

    FANG Chenghao (1999-), master student. His main research interests cover computer graphics, computer vision and 3D reconstruction. E-mail:121106022661@njust.edu.cn

  • Supported by:
    The Natural Science Foundation of China(62472224);The Fundamental Research Funds for the Central Universities(NJ2023032);The Open Project Program of the State Key Laboratory of CAD&CG of Zhejiang University(A2311);The Open Project Program of the State Key Laboratory of Novel Software Technology of Nanjing University(KFKT2024B37)

Abstract:

Under the condition of limited labeled samples, estimating 3D human pose and shape from single-view point clouds has consistently encountered issues such as low model estimation accuracy and weak generalization capability. Existing methods typically use a fine-tuning step to optimize the models for limited labeled samples, but this fine-tuning process significantly increases computational complexity and without fundamentally enhancing model generalization. To address these issues, a semi-supervised learning-based method was proposed for 3D human pose and shape estimation. Under conditions of limited labeled data, the proposed method utilized a large amount of unlabeled human point clouds to improve model accuracy and generalization capability. Specifically, weak and strong augmentations were applied to the unlabeled data, and 3D human parameter models were estimated for both types of augmented samples. Then, the accuracy of pseudo-labels for weakly-augmented samples was evaluated, and the predictions of strongly augmented samples were constrained based on consistency regularization. The procedure above was applied iteratively to gradually refine the quality of pseudo-labels and increase the number of pseudo-labels for training, thereby enhancing the model’s estimation accuracy. Extensive quantitative and qualitative experiments on various public datasets demonstrate that the proposed method enhanced the accuracy of 3D human pose and shape estimation under conditions of limited labeled samples and enhanced model generalization performance.

Key words: 3D human pose and shape estimation, single-view point clouds, semi-supervised learning, pseudo-label, data augmentation of point cloud

CLC Number: