Welcome to Journal of Graphics

Journal of Graphics ›› 2026, Vol. 47 ›› Issue (2): 368-379.DOI: 10.11996/JG.j.2095-302X.2026020368

• Computer Graphics and Virtual Reality • Previous Articles     Next Articles

3D model reconstruction based on retrieval and deformation techniques

PANG Min1,2, LI Zhentang1,2, ZHANG Yuan1,2, CUI Xiaokang1,2, XIONG Fengguang1,2()   

  1. 1 School of Computer Science and Technology, North University of China, Taiyuan Shanxi 030051, China
    2 Shanxi Key Laboratory of Machine Vision & Virtual Reality, Taiyuan Shanxi 030051, China
  • Received:2025-06-16 Accepted:2025-11-04 Online:2026-04-30 Published:2026-05-20
  • Contact: XIONG Fengguang
  • Supported by:
    National Natural Science Foundation of China(62272426);Shanxi Province Science and Technology Major Special Project(202201150401021);Youth Fund of Shanxi Province(202303021212189);Youth Fund of Shanxi Province(202303021212206)

Abstract:

As Virtual Reality (VR) and Augmented Reality (AR) technologies advance rapidly, the demand for high-quality 3D models has increased significantly. Traditional 3D modeling methods have drawbacks such as slow processing speed and poor adaptability to complex shapes. Consequently, a novel 3D model construction method based on 3D model retrieval and deformation was proposed. Firstly, a 3D model retrieval framework based on semantic keypoints was constructed, where sparse geometric feature points with semantic consistency were utilized to build a deformation-aware embedding space, enabling dynamic aggregation of global and local features. Meanwhile, Adaptive Global-CHANNEL Attention (AGCA) was embedded into a Transformer to form a joint attention mechanism, thereby enhancing the model’s expressiveness and retrieval accuracy. Then, for the retrieved models, a DGCNN-based keypoint-driven neural cage deformation algorithm was designed. The self-attention mechanism was utilized to calculate the influence weights of keypoints on vertices within local support regions. This process established a deformation mapping between feature keypoints and the neural cage structure, driving neural cage deformation to achieve fine-grained and constrained shape control. Finally, the loss function was improved by incorporating Chamfer distance and EMD distance constraints. This ensured that while focusing on local feature differences, geometric details were more accurately aligned, resulting in more precise 3D model reconstruction. Experiments were conducted on the Partnet and the Scan2CAD datasets to compare the proposed method with existing networks such as U-RED, ShapeFlow, and KP-RED. The results demonstrated that the proposed 3D model construction method could effectively handle noise and occlusion. The average value of the loss function was reduced by 33.33% and 41.67% on the Partnet dataset. moreover, on the Scan2CAD dataset, the average loss value was reduced by 3.6% compared with the baseline.

Key words: 3D model retrieval, deep learning, deformation of nerve cage, self attention mechanism, loss function

CLC Number: