Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2024, Vol. 45 ›› Issue (3): 464-471.DOI: 10.11996/JG.j.2095-302X.2024030464

Previous Articles     Next Articles

3D piece-wise planar reconstruction from a single indoor image based on self-augmented -attention mechanism

ZHU Guanghui(), MIAO Jun(), HU Hongli, SHEN Ji, DU Ronghua   

  1. School of Aeronautical Manufacturing Engineering, Nanchang Hangkong University, Nanchang Jiangxi 330063, China
  • Received:2023-09-20 Accepted:2024-03-04 Online:2024-06-30 Published:2024-06-06
  • Contact: MIAO Jun (1979-), associate professor, Ph.D. His main research interest cover image processing and pattern recognition, as well as 3D scene reconstruction. E-mail:miaojun@nchu.edu.cn
  • About author:

    ZHU Guanghui (1997-), master student. His main research interest covers computer image processing. E-mail:2314045303@qq.com

  • Supported by:
    National Natural Science Foundation of China(62162045);National Natural Science Foundation of China(62366032)

Abstract:

The piece-wise 3D reconstruction of indoor scenes using convolutional neural networks (CNN) has become one of the hot topics in the research of indoor scene modeling. However, the intertwining of planar and non-planar elements often leads to the network’s extraction of non-planar information mixed with planar features, thereby affecting the final segmentation accuracy. Moreover, there are significant scale differences in the planes present in indoor scenes, leading to pronounced class imbalances, where small-scale plane instances are prone to distortion. To address these challenges, this paper proposed a self-enhanced attention-based multi-scale feature fusion network for 3D plane segmentation reconstruction. This network can automatically learn planar features in the scene and effectively fuse feature information from different scales, thereby enhancing the accuracy of plane instance segmentation. At the same time, by assigning different weights to each pixel in the plane instance, particularly increasing the weight values for small-scale plane edge pixels, the channel representation of small-scale plane segmentation objects was further enhanced. Finally, a new loss function was constructed using balanced cross-entropy loss and dice loss to train the model, further improving the accuracy of plane segmentation. Extensive experiments demonstrated that the algorithm proposed achieves significant improvements in plane recall rate and segmentation accuracy, resulting in more accurate indoor 3D segmented plane reconstruction models.

Key words: deep learning, segmented plane reconstruction, multi-scale fusion, enhance attention, self-attention

CLC Number: