Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2025, Vol. 46 ›› Issue (3): 502-509.DOI: 10.11996/JG.j.2095-302X.2025030502

• Image Processing and Computer Vision • Previous Articles     Next Articles

Large scene reconstruction method based on voxel grid feature of NeRF

WANG Daolei(), DING Zijian, YANG Jun, ZHENG Shaokai, ZHU Rui, ZHAO Wenbin()   

  1. College of Energy and Mechanical Engineering, Shanghai University of Electric Power, Shanghai 201306, China
  • Received:2024-09-17 Accepted:2024-12-16 Online:2025-06-30 Published:2025-06-13
  • Contact: ZHAO Wenbin
  • About author:First author contact:

    WANG Daolei (1981-), professor, Ph.D. His main research interests cover computer vision, image processing, CAD/CAM. E-mail:alfredwdl@shiep.edu.cn

Abstract:

To address the problems of blurred rendering and missing details problems in neural radiation fields for large scenes, a rendering method suitable for large scenes was proposed that was guided by voxel mesh features and driven by ray sampling. This method can effectively enhance the accuracy of 3D models, which was particularly crucial for large-scale scene reconstruction and can be applicable to various scenarios such as architectural design and urban planning. Firstly, grid processing was performed on the reconstructed scene by allocating scene boundaries based on scene size and refining voxel units. Secondly, tensor decomposition was conducted on the information contained in the voxels, and gridded scene features were extracted. Neural radiance fields then focused on sampling based on the extracted features. Finally, the sampling results were fed into a neural network, and a Multilayer Perceptron renderer converted the features into color and density information, synthesizing view rendering results from various new perspectives. Multiple datasets were used for validation in the experiment. The experimental results demonstrated that, compared with other methods, the proposed approach achieved an average improvement of approximately 11% in PSNR, an average increase of about 12% in SSIM, and an average reduction of around 15% in LPIPS, with significantly enhanced visual effects.

Key words: neural radiation fields, large scene, 3D reconstruction, deep learning, image rendering

CLC Number: