欢迎访问《图学学报》 分享到:

图学学报 ›› 2021, Vol. 42 ›› Issue (4): 525-534.DOI: 10.11996/JG.j.2095-302X.2021040525

• 综述 • 上一篇    下一篇

面向移动增强现实的实时深度学习目标检测方法综述

  

  1. 1. 北京理工大学光电学院,北京 100081; 2. 北京电影学院未来影像高精尖创新中心,北京 100088
  • 出版日期:2021-08-31 发布日期:2021-08-05
  • 基金资助:
    国家自然科学基金项目(61960206007);广东省重点领域研发计划项目(2019B010149001);高等学校学科创新引智计划项目(B18005)

Review of real-time deep learning-based object detection for mobile augmented reality

  1. 1. School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, China;
    2. Advanced Innovation Center for Future Visual Entertainment, Beijing Film Academy, Beijing 100088, China
  • Online:2021-08-31 Published:2021-08-05
  • Supported by:
    National Natural Science Foundation of China (61960206007); R & D Projects in Key Areas of Guangdong (2019B010149001);
    Programme of Introducing Talents of Discipline to Universities (B18005)

摘要: 移动增强现实(AR)借助智能移动终端将虚拟信息和真实世界进行实时融合,能否实时准确地对
环境中需要增强的物体进行目标检测直接决定了系统的性能。随着深度学习的快速发展,近年来出现了大量的
基于深度学习的目标检测方法。由于存在移动增强设备计算能力有限、能耗大、模型尺寸大以及卸载任务到边
缘云端的网络延迟严重等问题,将深度学习方法应用于移动 AR 的目标检测是一项具有挑战性的问题。首先从
Two stage 和 One stage 的 2 方面对目前深度学习目标检测算法进行综述;然后对面向移动 AR 的目标检测系统
架构进行归纳分类,分析了基于本地端、云端或边缘端和协作式的移动 AR 目标检测系统并总结了各自的优势
和局限性;最后对移动 AR 中目标检测亟待解决的问题和未来发展方向进行了展望和预测。

关键词: 目标检测, 移动增强现实, 深度学习, 计算机视觉, 移动边缘计算

Abstract: Mobile augmented reality (AR) is a technology that integrates virtual information with the real world on the
mobile intelligent terminal, therefore the ability to accurately detect the to-be-enhanced objects in the environment
directly determines the performance of mobile AR systems. With the rapid advancement of deep learning, a large
number of deep learning-based methods have been proposed for better detection. However, such problems as limited
computing power, high energy consumption, large model size, and offloading latency make it difficult to combine
deep learning-based object detection with mobile AR. This paper first summarized previous studies on deep
learning-based object detection from both aspects of two stages and one stage, then categorized the object detection
systems for mobile AR, and analyzed the approaches based on local, cloud, or edge ends, as well as collaboration.
Finally, both the advantages and limitations of these methods were summarized, and predictions were made on the problems to be solved and the future development of object detection in mobile AR.

Key words: object detection, mobile augmented reality, deep learning, computer vision, mobile edge computing

中图分类号: