Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2022, Vol. 43 ›› Issue (3): 469-477.DOI: 10.11996/JG.j.2095-302X.2022030469

• Computer Graphics and Virtual Reality • Previous Articles     Next Articles

ST-Rec3D: a structure and target-aware 3D reconstruction

  

  1. 1. School of Computer Science and Engineering, North Minzu University, Yinchuan Ningxia 750021, China;
    2. The Key Laboratory of Images & Graphics Intelligent Processing of State Ethnic Affairs Commission, Yinchuan Ningxia 750021, China
  • Online:2022-06-30 Published:2022-06-28
  • Supported by:
    National Natural Science Foundation of China (61762003, 62162001); “Light of the West” Talent Training and Introduction Plan of
    Chinese Academy of Sciences (JF2012c016-2); Ningxia Excellent Talents Support Program; Natural Science Foundation of Ningxia
    Province of China (2022AAC02041)

Abstract:

Image-based 3D reconstruction is the process of producing 3D representations of an object based on its single or multiple images. Existing methods for 3D reconstruction can directly learn to transform image features into 3D representations, using encoder-decoder structure, combined with binary cross entropy function and its deformation. However, the encoder cannot extract enough information from images to reconstruct high-quality 3D shapes, resulting in inaccurate Geometric details of reconstructed 3D objects. The loss functions based on the binary cross entropy function underperforms in target perception when the voxel distribution is imbalanced, leading to problems of incompleteness such as fractures and missing in the reconstruction results. To address these problems, a structure and target-aware 3D object reconstruction framework was proposed for single-view and multi-view 3D reconstruction, named ST-Rec3D. Combined with attention mechanism, we designed an encoder with a spatial perception structure, namely structure-aware encoder. In doing so, the spatial structure information could be fully captured in the input image and the local details of the reconstructed object could be effectively perceived. The utilization of IoU loss in the 3D voxel reconstruction, in the case of uneven voxel distribution, could accurately perceive the target object to ensure the integrity and accuracy of the reconstructed object. Experimental results demonstrate that ST-Rec3D can give a significant boost to reconstruction quality and outperform state-of-the-art methods on the ShapeNet and Pix3D.

Key words: 3D reconstruction, structure-aware, target-aware, attention mechanism, IoU loss

CLC Number: