Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2025, Vol. 46 ›› Issue (5): 1010-1017.DOI: 10.11996/JG.j.2095-302X.2025051010

• Image Processing and Computer Vision • Previous Articles     Next Articles

Research on UAV three-dimensional scene navigation based on deep reinforcement learning

LIU Bokai1(), YIN Xuefeng1, SUN Chuanyu1, GE Huilin2(), WEI Ziqi3, JIANG Yutong4, PIAO Haiyin5, ZHOU Dongsheng6, YANG Xin1   

  1. 1 Key Laboratory of Social Computing and Cognitive Intelligence, School of Computer Science, Dalian University of Technology, Dalian Liaoning 116024, China
    2 School of Automation, Jiangsu University of Science and Technology, Zhenjiang Jiangsu 212100, China
    3 Nstitute of Automation, Chinese Academy of Sciences, Beijing 100190, China
    4 National Key Laboratory of Advanced Off-road System Technology, China North Vehicle Research Institute, Beijing 100072, China
    5 Shenyang Aircraft Design and Research Institute, Aviation Industry Corporation of China, Shenyang Liaoning 110035, China
    6 School of Software Engineering, Dalian University, Shenyang Liaoning 116024, China
  • Received:2024-12-17 Accepted:2025-04-21 Online:2025-10-30 Published:2025-09-10
  • Contact: GE Huilin
  • About author:First author contact:

    LIU Bokai (1999-), master student. His main research interest covers graphic image processing, etc. E-mail:lbk2593469678@163.com

  • Supported by:
    National Natural Science Foundation of China(62441216);Major Project of the Ministry of Science and Technology on “Brain Science and Brain-like Research”(2022ZD0210500)

Abstract:

In recent years, with the UAV industry and application demands expanding, the realization of UAV autonomy and intelligence has been identified as a critical challenge As a foundational technology in the field of autonomous control of UAVs, UAV navigation and exploration have become a top priority in UAV application research. Currently, most UAV navigation and exploration methods rely on the reconstruction of environmental information, consuming excessive computation and memory, thus failing to meet the increasingly complex scenarios and real-time requirements. Therefore, based on the excellent representation learning ability of deep learning and the self-learning decision-making ability of reinforcement learning, an autonomous navigation method for unmanned aerial vehicles was proposed. By continuously optimizing decision-making strategies through self-learning, the navigation task could be better completed. The method first constructed a continuous action space and a non-sparse reward function to guide the learning process of the drone; then designed feature-extraction and decision-making modules to enhance the perception and decision-making capabilities of the UAV. The experimental results demonstrated that the algorithm exhibited the best navigation and obstacle avoidance performance in the simulated 3D scene. The navigation success rate in the designed 3D scene reached 87%, a 33% increase in average cumulative reward convergence value over that of the same period method, reduced the training time, and improved training stability.

Key words: deep reinforcement learning, attention mechanism, unmanned aerial vehicle, navigation and obstacle avoidance, 3D scene

CLC Number: