欢迎访问《图学学报》

图学学报 ›› 2026, Vol. 47 ›› Issue (2): 368-379.DOI: 10.11996/JG.j.2095-302X.2026020368

• 计算机图形学与虚拟现实 • 上一篇    下一篇

基于检索与变形技术的三维模型重构

庞敏1,2, 李振堂1,2, 张元1,2, 崔晓康1,2, 熊风光1,2()   

  1. 1 中北大学计算机科学与技术学院山西 太原 030051
    2 机器视觉与虚拟现实山西省重点实验室山西 太原 030051
  • 收稿日期:2025-06-16 接受日期:2025-11-04 出版日期:2026-04-30 发布日期:2026-05-20
  • 通讯作者:熊风光,E-mail:hopenxfg@nuc.edu.cn
  • 基金资助:
    国家自然科学基金(62272426);山西省科技重大专项计划(202201150401021);山西省青年基金(202303021212189);山西省青年基金(202303021212206)

3D model reconstruction based on retrieval and deformation techniques

PANG Min1,2, LI Zhentang1,2, ZHANG Yuan1,2, CUI Xiaokang1,2, XIONG Fengguang1,2()   

  1. 1 School of Computer Science and Technology, North University of China, Taiyuan Shanxi 030051, China
    2 Shanxi Key Laboratory of Machine Vision & Virtual Reality, Taiyuan Shanxi 030051, China
  • Received:2025-06-16 Accepted:2025-11-04 Published:2026-04-30 Online:2026-05-20
  • Contact: XIONG Fengguang,E-mail:hopenxfg@nuc.edu.cn
  • Supported by:
    National Natural Science Foundation of China(62272426);Shanxi Province Science and Technology Major Special Project(202201150401021);Youth Fund of Shanxi Province(202303021212189);Youth Fund of Shanxi Province(202303021212206)

摘要:

随着虚拟现实(VR)、增强现实(AR)技术的快速发展,对高质量三维模型的需求日益增加。传统建模方法存在处理速度慢、复杂形状适应性差等问题。因此,提出一种基于检索与变形的三维模型构建方法。首先构建一个基于语义关键点的三维模型检索框架,以模型具有的稀疏关键点为基础构建变形感知嵌入空间,实现全局特征与局部特征的动态聚合,同时将自适应全局通道注意力(AGCA)嵌入Transformer构建联合注意力机制,以提升模型的表达能力和检索精度;然后针对检索结果模型设计一套基于DGCNN关键点驱动神经笼变形算法,结合自注意力机制计算关键点对局部支撑区域内顶点的影响权重,根据特征关键点与神经笼结构之间的变形映射,驱动神经笼变形,实现精细且受约束的形变控制;最后结合倒角距离和EMD距离约束,改进损失函数,在关注局部特征差异同时,更准确地对齐几何细节,实现更精确的三维模型重建。实验验证在开源数据集Partnet和Scan2CAD上进行,并和U-RED,ShapeFlow和KP-RED等网络进行效果对比。实验结果表明本文提出的三维模型构建方法能够有效应对噪声与遮挡问题,其中Partnet数据集上损失函数的平均值分别降低33.33%和41.67%;在Scan2CAD上,损失函数的平均值基于baseline降低了3.6%。

关键词: 三维模型检索, 深度学习, 神经笼变形, 自注意力机制, 损失函数

Abstract:

As Virtual Reality (VR) and Augmented Reality (AR) technologies advance rapidly, the demand for high-quality 3D models has increased significantly. Traditional 3D modeling methods have drawbacks such as slow processing speed and poor adaptability to complex shapes. Consequently, a novel 3D model construction method based on 3D model retrieval and deformation was proposed. Firstly, a 3D model retrieval framework based on semantic keypoints was constructed, where sparse geometric feature points with semantic consistency were utilized to build a deformation-aware embedding space, enabling dynamic aggregation of global and local features. Meanwhile, Adaptive Global-CHANNEL Attention (AGCA) was embedded into a Transformer to form a joint attention mechanism, thereby enhancing the model’s expressiveness and retrieval accuracy. Then, for the retrieved models, a DGCNN-based keypoint-driven neural cage deformation algorithm was designed. The self-attention mechanism was utilized to calculate the influence weights of keypoints on vertices within local support regions. This process established a deformation mapping between feature keypoints and the neural cage structure, driving neural cage deformation to achieve fine-grained and constrained shape control. Finally, the loss function was improved by incorporating Chamfer distance and EMD distance constraints. This ensured that while focusing on local feature differences, geometric details were more accurately aligned, resulting in more precise 3D model reconstruction. Experiments were conducted on the Partnet and the Scan2CAD datasets to compare the proposed method with existing networks such as U-RED, ShapeFlow, and KP-RED. The results demonstrated that the proposed 3D model construction method could effectively handle noise and occlusion. The average value of the loss function was reduced by 33.33% and 41.67% on the Partnet dataset. moreover, on the Scan2CAD dataset, the average loss value was reduced by 3.6% compared with the baseline.

Key words: 3D model retrieval, deep learning, deformation of nerve cage, self attention mechanism, loss function

中图分类号: