Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2025, Vol. 46 ›› Issue (3): 551-557.DOI: 10.11996/JG.j.2095-302X.2025030551

• Image Processing and Computer Vision • Previous Articles     Next Articles

The next best view navigation technology based on RGB features

ZHOU Zheng(), DAI Yaqiao, YI Renjiao, LAN Long, ZHU Chenyang()   

  1. School of Computer Science, National University of Defense Technology, Changsha Hunan 410000, China
  • Received:2024-08-23 Accepted:2025-03-03 Online:2025-06-30 Published:2025-06-13
  • Contact: ZHU Chenyang
  • About author:First author contact:

    ZHOU Zheng (1997-), master student. His main research interest covers computer graphics. E-mail:zhouzheng@nudt.edu.cn

  • Supported by:
    National Natural Science Foundation of China(62325221);National Natural Science Foundation of China(62132021);National Natural Science Foundation of China(62372457);Young Elite Scientists Sponsorship Program by CAST(2023QNRC001);Natural Science Foundation of Hunan Province of China(2021RC3071);Natural Science Foundation of Hunan Province of China(2022RC1104);NUDT Research Grants(ZK22-52);State Key Laboratory of High Performance Computing Foundation(2023KJWHPCL02)

Abstract:

Neural radiance field (NeRF) has shown excellent performance in reconstructing 3D scenes from 2D images. Using 2D images as training data, the 3D structure of scenes could be reconstructed and new views could be rendered with high quality. Although NeRF is very effective in reconstructing 3D scenes, issues of slow training speed and long inference time are encountered, and the sample quality is closely related to the quality of 3D scene reconstruction. In order to address the challenge of high-quality 3D reconstruction of NeRF under conditions of low sample quality, two sets of NeRFs with different hash codes were employed to learn the same scene and to evaluate the gap between the information gain of candidate views to guide view sampling. A new framework of Next Best View navigation technology based on RGB features was proposed. This framework exhibited strong robustness with sparse training data, was capable of capturing the next best view with high information gain through RGB feature evaluation, and optimized NeRF training, thereby improving the quality of new view synthesis with a minimal number of additional views. By optimizing the NeRF training process, the network convergence speed was increased by approximately 10 times, and the memory usage was reduced by 39.8%. A large number of experiments have verified the effectiveness and robustness of the proposed model.

Key words: neural radiance field, hash coding, sparse reconstruction, information gain, active learning

CLC Number: