欢迎访问《图学学报》 分享到:

图学学报 ›› 2024, Vol. 45 ›› Issue (1): 1-13.DOI: 10.11996/JG.j.2095-302X.2024010001

• 综述 • 上一篇    下一篇

神经辐射场加速算法综述

王稚儒1(), 常远2, 鲁鹏3, 潘成伟1()   

  1. 1.北京航空航天大学人工智能研究院,北京 100191
    2.中国电信股份有限公司研究院,北京 102209
    3.北京邮电大学人工智能学院,北京 100876
  • 收稿日期:2023-09-26 接受日期:2023-12-11 出版日期:2024-02-29 发布日期:2024-02-29
  • 通讯作者: 潘成伟(1989-),男,副教授,博士。主要研究方向为计算机图形学、计算机视觉等。E-mail:pancw@buaa.edu.cn
  • 作者简介:

    王稚儒(2001-),男,硕士研究生。主要研究方向为计算机图形学与深度学习。E-mail:19241085@buaa.edu.cn

  • 基金资助:
    新一代人工智能国家科技重大专项(2022ZD0116401)

A review on neural radiance fields acceleration

WANG Zhiru1(), CHANG Yuan2, LU Peng3, PAN Chengwei1()   

  1. 1. Institute of Artificial Intelligence, Beihang University, Beijing 100191, China
    2. China Telecom Research Institute, Beijing 102209, China
    3. School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Received:2023-09-26 Accepted:2023-12-11 Online:2024-02-29 Published:2024-02-29
  • Contact: PAN Chengwei (1989-), associate professor, Ph.D. His main research interests cover computer graphics, computer vision, etc. E-mail:pancw@buaa.edu.cn
  • About author:

    WANG Zhiru (2001-), master student. His main research interests cover computer graphics and deep learning. E-mail:19241085@buaa.edu.cn

  • Supported by:
    National Science and Technology Major Project(2022ZD0116401)

摘要:

近年来,神经辐射场(NeRF)已成为计算机图形学和计算机视觉领域中一个重要的研究方向,因其高逼真的视觉合成效果,在真实感渲染、虚拟现实、人体建模、城市地图等领域得到了广泛的应用。NeRF利用神经网络从输入图片集中学习三维场景的隐式表征,并合成高逼真的新视角图像。然而原始NeRF模型的训练和推理速度都很慢,难以在真实环境下部署与应用。针对NeRF的加速问题,研究者们从场景建模方法、光线采样策略等方面展开对NeRF进行提速的研究。该类工作大致可分为以下研究方向:烘焙模型、与离散表示方法结合、提高采样效率、利用哈希编码降低MLP网络复杂度、引入场景泛化性、引入深度监督信息和分解方法。通过介绍NeRF模型提出的背景,对上述思路的代表方法的优势与特点进行了讨论和分析,最后总结了NeRF相关工作在加速方面所取得的进展和对于未来的展望。

北京航空航天大学潘成伟副教授及其学生王稚儒等回顾了国内外近年来神经辐射场加速方面的有关研究,并总结归类了以下加速思路:烘焙模型、与离散表示方法结合、提高采样效率、利用哈希编码降低MLP网络复杂度、引入场景泛化性、引入深度监督信息和分解方法。然后对上述各思路的代表方法进行了介绍和分析,同时比较和讨论了各方法的优势与特点。最后对当前神经辐射场在加速方面取得的进展进行总结并对未来的发展进行展望,为该领域更深入的探索提供参考。

关键词: 神经辐射场, 视点合成, 神经渲染, NeRF加速, 深度学习

Abstract:

Neural radiance field (NeRF) has become an important research area in computer graphics and computer vision in recent years. Due to its highly realistic visual synthesis effects, NeRF has been widely used in photorealistic rendering, virtual reality, human body modeling, urban mapping, and other domains. NeRF employs neural networks to learn implicit representations of 3D scenes from input image sets and to synthesize highly realistic novel view images. However, the training and inference speeds of the primitive NeRF model are very slow, posing challenges for real-condition deployment and application. To address the acceleration problem of NeRF, researchers have studied the acceleration of NeRF from the aspects of scene modeling methods and ray sampling strategies. Those works can be categorized into the following research directions: baking model, integrating models with discrete representation methods, enhancing sampling efficiency, using hash coding to reduce the complexity of MLP network, introducing scene generalization, and introducing deep supervision information and field decomposition methods. After introducing the background of the NeRF model, the advantages and characteristics of the representative methods of the above ideas were discussed and analyzed. Finally, the progress made in the acceleration of NeRF-related work and future prospects were summarized.

Key words: neural radiance field, view synthesis, neural rendering, NeRF acceleration, deep learning

中图分类号: