欢迎访问《图学学报》 分享到:

图学学报 ›› 2023, Vol. 44 ›› Issue (5): 997-1012.DOI: 10.11996/JG.j.2095-302X.2023050997

• 计算机图形学与虚拟现实 • 上一篇    下一篇

一种基于简易标记点编码的光学跟踪系统

韩兆阳1,2(), 翁冬冬1,2, 郭署山3, 贺文杰1,2, 江海燕1,2, 李冬1,2()   

  1. 1.北京市混合现实与新型显示工程技术研究中心,北京 100081
    2.北京理工大学光电学院,北京 100081
    3.北京临近空间飞艇技术开发有限公司,北京 100070
  • 收稿日期:2023-02-12 接受日期:2023-06-06 出版日期:2023-10-31 发布日期:2023-10-31
  • 通讯作者: 李冬(1986-),男,助理研究员,博士。主要研究方向为虚拟现实、增强现实和人机交互。E-mail:drli@bit.edu.cn
  • 作者简介:韩兆阳(1998-),男,硕士研究生。主要研究方向为虚拟现实、增强现实和人机交互。E-mail:hzy3_3@163.com
  • 基金资助:
    国家国防科技工业局基础科研项目(JCKY2019205A004);国家自然科学基金项目(62072036)

An optical tracking system based on simple marker encoding

HAN Zhao-yang1,2(), WENG Dong-dong1,2, GUO Shu-shan3, HE Wen-jie1,2, JIANG Hai-yan1,2, LI Dong1,2()   

  1. 1. Beijing Engineering Research Center of Mixed Reality and Advanced Display, Beijing 100081 China
    2. School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081 China
    3. Beijing Near Space Airship Technology Development Co, Ltd, Beijing 100070 China
  • Received:2023-02-12 Accepted:2023-06-06 Online:2023-10-31 Published:2023-10-31
  • Contact: LI Dong (1986-), assistant researcher, Ph.D. His main research interests cover visual reality, augmented reality and human-computer interaction. E-mail:drli@bit.edu.cn
  • About author:HAN Zhao-yang (1998-), master student. His main research interests cover visual reality, augmented reality and human-computer interaction. E-mail:hzy3_3@163.com
  • Supported by:
    Basic Research Project of State Administration of Science(JCKY2019205A004);National Natural Science Foundation of China(62072036)

摘要:

在虚拟现实,增强现实或是混合现实应用中,实时获取用户和交互对象的位姿是构建高沉浸感虚拟环境的前提条件。随着虚拟现实技术的不断发展,用户对虚拟环境中运动范围的要求不断提高,不再仅满足于单个房间内的小范围移动,而希望能在更大范围进行漫游和交互。为此,提出了一种光学定位追踪系统,通过在天花板或地面布置少量的红外LED标记点来实现精确的相机三维追踪。所述跟踪系统使用了最基本的点、线元素构建标记点图案,通过设计该标记点图案的编码原则、布局重复特征检索方法和对应点匹配算法,实现了标记点图像的快速、精确解析。实验证明系统计算的位置精度可以控制在毫米级别范围内,同时在对抗标记点抖动和遮挡等方面保持了较高的识别准确率。由此实现的跟踪系统具有低成本、易拓展和抗遮挡等特点,可以满足百平方米级别范围的实时追踪定位需求。

关键词: 光学追踪, 标记点编码, 图像处理, 定位导航, 虚拟现实系统

Abstract:

In visual, augmented, or mixed reality applications, real-time acquisition of user and object poses is a prerequisite for building a highly immersive virtual environment. With the continuous development of virtual reality technology, users’ demands for the range of motion in virtual environments have been increasing. They are no longer content with limited movement within the confined space of a single room; instead, they seek to roam and interact in a larger range of environments. Most of the tracking systems used by popular AR/VR devices today are designed for room-level or even smaller range tracking. When larger range tracking is required, these systems either introduce greater error drift or require more hardware to be arranged in the room to cover a larger area (e.g. Light House), which creates huge hardware costs and a complex configuration process, making them not suitable for general and personal use. To address this, a system for optical positioning and tracking was proposed, which could achieve accurate 3D camera tracking by deploying a small number of infrared LED markers on the ceiling or floor. The proposed tracking system utilized the most basic dot and line elements to build the landmark pattern. Compared with traditional marker-based systems, individual dots do not contain any information and are identified only after they are formed into a basic graphic element with a line next to them. The straight line segments exist to increase the redundancy of the basic graph elements, thus avoiding the situation where the dots are obscured and cannot be recognized. By designing the encoding principle of the marker patterns, employing the layout repeated feature retrieval method, and implementing the corresponding points matching algorithms, the fast and accurate decoding of the landmark images was realized. Experiments have proven that the system could achieve the position accuracy at the millimeter level. In robustness experiments, the proposed method could maintain higher recognition accuracy even in the presence of challenges such as large inclination angles and marker point occlusion. These measurements show the potential of our system to cope with more extreme situations. We also count the processing time of the system, and the average latency of our method is 4.34 ms, which indicates that performing sparse graph element layout and simplifying marker point decoding effectively reduces the system computation time. The resulting tracking system possesses characteristics such as low cost, easy scalability, and resilience to occlusion, thereby meeting the demand for real-time tracking and positioning at the 100 square meter level.

Key words: optical tracking, marker encoding, image processing, positioning and navigation, VR system

中图分类号: