Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2024, Vol. 45 ›› Issue (1): 230-239.DOI: 10.11996/JG.j.2095-302X.2024010230

• Computer Graphics and Virtual Reality • Previous Articles     Next Articles

Dense point cloud reconstruction network based on adaptive aggregation recurrent recursion

WANG Jiang’an(), HUANG Le, PANG Dawei, QIN Linzhen, LIANG Wenqian   

  1. School of Information Engineering, Chang’an University, Xi’an, Shaanxi 710064, China
  • Received:2023-06-19 Accepted:2023-12-04 Online:2024-02-29 Published:2024-02-29
  • About author:

    WANG Jiangan (1981-), associate professor, Ph.D. His main research interests cover computer vision and 3D modeling. E-mail:wangjiangan@126.com

  • Supported by:
    National Natural Science Foundation of China(61771075);Natural Science Foundation of Shaanxi Province(2017JQ6048);Teaching Reform Research Topics in Colleges and Universities in Jiangxi Province(JXJG-22-24-6)

Abstract:

To address the problems such as difficulties in weak texture reconstruction, high resource consumption, and long reconstruction time, a multi-stage dense point cloud reconstruction network based on adaptive aggregation cyclic recursive convolution was proposed, namely A2R2-MVSNet (adaptive aggregation recurrent recursive multi view stereo net). This method first introduced a feature extraction module based on multi-scale cyclic recursive residuals to aggregate contextual semantic information, addressing the problem of difficult feature extraction in weakly textured or textureless regions. In the cost body regularization part, a residual regularization module was proposed. This module enhanced the ability of 3D CNN to extract and aggregate contextual semantics under the premise of slightly increasing memory consumption. The experimental results demonstrated that the proposed method ranked high in comprehensive metrics on the DTU dataset, showcasing superior performance in reconstructing details. Additionally, it could generate good depth maps and point cloud results on the BlendedMVS dataset. Furthermore, the network was tested for generalization on self-collected large-scale high-resolution datasets. Thanks to the coarse-to-fine multi-stage idea and our proposed module, the network could not only generate high-accuracy and complete depth maps, but also perform high-resolution reconstructions suitable for practical applications.

Key words: deep learning, computer vision, 3D reconstruction, dense reconstruction, multi-view stereo, recurrent neural network

CLC Number: