面向移动增强现实的实时深度学习目标检测方法综述

doi:10.11996/JG.j.2095-302X.2021040525

图学学报 ›› 2021, Vol. 42 ›› Issue (4): 525-534.DOI: 10.11996/JG.j.2095-302X.2021040525

面向移动增强现实的实时深度学习目标检测方法综述

1. 北京理工大学光电学院，北京 100081； 2. 北京电影学院未来影像高精尖创新中心，北京 100088

出版日期:2021-08-31 发布日期:2021-08-05
基金资助:
国家自然科学基金项目(61960206007)；广东省重点领域研发计划项目(2019B010149001)；高等学校学科创新引智计划项目(B18005)

Review of real-time deep learning-based object detection for mobile augmented reality

1. School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, China;
2. Advanced Innovation Center for Future Visual Entertainment, Beijing Film Academy, Beijing 100088, China

Online:2021-08-31 Published:2021-08-05
Supported by:
National Natural Science Foundation of China (61960206007); R & D Projects in Key Areas of Guangdong (2019B010149001);
Programme of Introducing Talents of Discipline to Universities (B18005)

摘要/Abstract

摘要： 移动增强现实(AR)借助智能移动终端将虚拟信息和真实世界进行实时融合，能否实时准确地对
环境中需要增强的物体进行目标检测直接决定了系统的性能。随着深度学习的快速发展，近年来出现了大量的
基于深度学习的目标检测方法。由于存在移动增强设备计算能力有限、能耗大、模型尺寸大以及卸载任务到边
缘云端的网络延迟严重等问题，将深度学习方法应用于移动 AR 的目标检测是一项具有挑战性的问题。首先从
Two stage 和 One stage 的 2 方面对目前深度学习目标检测算法进行综述；然后对面向移动 AR 的目标检测系统
架构进行归纳分类，分析了基于本地端、云端或边缘端和协作式的移动 AR 目标检测系统并总结了各自的优势
和局限性；最后对移动 AR 中目标检测亟待解决的问题和未来发展方向进行了展望和预测。

关键词: 目标检测, 移动增强现实, 深度学习, 计算机视觉, 移动边缘计算

Abstract: Mobile augmented reality (AR) is a technology that integrates virtual information with the real world on the
mobile intelligent terminal, therefore the ability to accurately detect the to-be-enhanced objects in the environment
directly determines the performance of mobile AR systems. With the rapid advancement of deep learning, a large
number of deep learning-based methods have been proposed for better detection. However, such problems as limited
computing power, high energy consumption, large model size, and offloading latency make it difficult to combine
deep learning-based object detection with mobile AR. This paper first summarized previous studies on deep
learning-based object detection from both aspects of two stages and one stage, then categorized the object detection
systems for mobile AR, and analyzed the approaches based on local, cloud, or edge ends, as well as collaboration.
Finally, both the advantages and limitations of these methods were summarized, and predictions were made on the problems to be solved and the future development of object detection in mobile AR.

Key words: object detection, mobile augmented reality, deep learning, computer vision, mobile edge computing

中图分类号:

TP 391

高文婷, 刘越. 面向移动增强现实的实时深度学习目标检测方法综述[J]. 图学学报, 2021, 42(4): 525-534.

GAO Wen-ting , LIU Yue. Review of real-time deep learning-based object detection for mobile augmented reality[J]. Journal of Graphics, 2021, 42(4): 525-534.

[1]	东辉, 陈鑫凯, 孙浩, 姚立纲. 基于改进 YOLOv4 和图像处理的蔬菜田杂草检测[J]. 图学学报, 2022, 43(4): 559-569.
[2]	廖仕敏, 刘仰川, 朱叶晨, 王艳玲, 高欣 . 一种基于 CycleGAN 改进的低剂量 CT 图像增强网络[J]. 图学学报, 2022, 43(4): 570-578.
[3]	张盾, 黄志开, 王欢, 吴义鹏, 王颖, 邹家豪. 基于多尺度特征实现超参进化的野生菌分类研究与应用[J]. 图学学报, 2022, 43(4): 580-589.
[4]	梁振宇, 华嘉皓, 陈浩龙, 邓逸川. 基于计算机视觉的建筑施工期临时结构损伤识别方法 [J]. 图学学报, 2022, 43(4): 608-615.
[5]	刘南杉, 裴云强, 蒋皓, 韩永国, 吴亚东, 王赋攀, 易思恒. 基于VD-MobileNet 网络的 WebAR生活垃圾分类信息可视化方法[J]. 图学学报, 2022, 43(4): 667-676.
[6]	熊琛, 陈立斌, 李林泽, 许镇, 赵杨平. 基于计算机视觉与 BIM 的裂缝可视化管理方法[J]. 图学学报, 2022, 43(4): 721-728.
[7]	范新南, 黄伟盛, 史朋飞, 辛元雪, 朱凤婷, 周润康. 基于改进 YOLOv4 的嵌入式变电站仪表检测算法[J]. 图学学报, 2022, 43(3): 396-403.
[8]	李华恩, 赵洋, 陈缘, 张效娟. 基于递归对齐网络的黑白老卡通高清重制[J]. 图学学报, 2022, 43(3): 434-442.
[9]	姜柳, 史健勇, 付功义, 潘泽宇, 王朝宇. 基于 BIM 和深度学习的建筑平面凹凸不规则识别[J]. 图学学报, 2022, 43(3): 522-529.
[10]	林佳瑞, 程志刚, 韩宇, 尹云鹏. 基于 BERT 预训练模型的灾害推文分类方法[J]. 图学学报, 2022, 43(3): 530-536.
[11]	姜莱, 于震, 王鹏飞, 周东生, 侯亚庆 . 音频驱动跨模态视觉生成算法综述[J]. 图学学报, 2022, 43(2): 181-188.
[12]	高铭, 张荷花, 张庭瑞, 张轩铭. 基于深度学习的公共建筑像素施工图空间识别[J]. 图学学报, 2022, 43(2): 189-196.
[13]	胡俊, 顾晶晶, 王秋红. 基于遥感图像的多模态小目标检测[J]. 图学学报, 2022, 43(2): 197-204.
[14]	廖志伟, 金兢, 张超凡, 杨学志. 基于分层压缩激励的 ASPP 网络单目深度估计[J]. 图学学报, 2022, 43(2): 214-222.
[15]	李妮妮, 王夏黎, 付阳阳, 郑凤仙, 何丹丹, 袁绍欣. 一种优化 YOLO 模型的交通警察目标检测方法[J]. 图学学报, 2022, 43(2): 296-305.

面向移动增强现实的实时深度学习目标检测方法综述

Review of real-time deep learning-based object detection for mobile augmented reality

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价