基于透视降采样和神经网络的地面标志检测

doi:10.11996/JG.j.2095-302X.2022020288

图学学报 ›› 2022, Vol. 43 ›› Issue (2): 288-295.DOI: 10.11996/JG.j.2095-302X.2022020288

• 图像处理与计算机视觉 • 上一篇下一篇

基于透视降采样和神经网络的地面标志检测

1. 山东大学信息科学与工程学院，山东青岛 266237；
2. 山东高速信息集团有限公司，山东济南 250000

出版日期:2022-04-30 发布日期:2022-05-07
基金资助:
山东省科技发展计划重点项目(2019GGX101018)；

山东省自然科学基金项目(ZR2017MF057)

Landmark detection based on perspective down-sampling and neural network

1. School of Information Science and Engineering, Shandong University, Qingdao Shandong 266237, China;
2. Shandong Hi-Speed Information Group Co, Ltd, Jinan Shandong 250000, China

Online:2022-04-30 Published:2022-05-07
Supported by:
Key Projects of Science and Technology Development Plan of Shandong Province(2019GGX101018);

National Natural Science Foundation of Shandong (ZR2017MF057)

摘要/Abstract

摘要： 在智能驾驶领域，为实时精确检测路面的导向标志，提出一种基于透视降采样和神经网络的地标
检测方法，有效解决传统检测方法实时性较差、复杂场景和远处小目标检测准确率较低的问题。首先，选取图像
感兴趣区域进行透视降采样，降低道路图像近处分辨率，缩小图像尺寸，同时消除透视投影误差。其次对
YOLOv3-tiny 目标检测网络进行改进，采用 k-means++算法对自建数据集的边界框聚类；添加卷积层强化浅层特
征，提升小目标表征能力；改变特征金字塔融合尺度，将预测输出调整为适合地标尺寸的 26×26 和 52×52。最后，
在自建多场景数据集上测试，准确率由 78%提升到 99%，模型大小由 33.8 MB 减小为 8.3 MB。结果表明，基于
透视降采样和神经网络的地标检测方法鲁棒性强，对小目标检测精度更高，易于在低端嵌入式设备上部署。

关键词: 透视降采样, YOLOv3-tiny, 地标检测, 数据集, k-means++

Abstract: In the field of intelligent driving, a neural network-based and perspective down-sampling-based landmark
detection method was proposed to accurately detect the road guide signs in real time. This proposed method can
effectively solve the problems of poor real-time performance of traditional detection methods and low detection
accuracy for complex scenes and remote small targets. Firstly, the region of interest for the image was selected for
perspective down-sampling to reduce the near resolution of the road image, reduce the image size, and eliminate the
perspective projection error. Secondly, the YOLOv3-tiny target detection network was enhanced. The boundary frame
clustering of self-built data set was implemented by k-means++. The convolution layer was added to strengthen the
shallow features and enhance the small target representation ability. By changing the fusion scale of feature pyramid,
the prediction output was adjusted to 26×26 and 52×52. Finally, the accuracy rate was elevated from 78% to 99% on
the self-built multi-scene data set, and the model size was reduced from 33.8 MB to 8.3 MB. The results show that a neural network-based and perspective down-sampling-based landmark detection method displays strong robustness,
higher detection accuracy for small targets, and is readily deployable on low-end embedded devices.

Key words: perspective down-sampling, YOLOv3-tiny, landmark detection, data set, k-means++

中图分类号:

TP 391

李玉珍, 陈辉, 王杰, 荣文 . 基于透视降采样和神经网络的地面标志检测[J]. 图学学报, 2022, 43(2): 288-295.

LI Yu-zhen, CHEN Hui, WANG Jie, RONG Wen. Landmark detection based on perspective down-sampling and neural network[J]. Journal of Graphics, 2022, 43(2): 288-295.

[1]	胡俊, 顾晶晶, 王秋红. 基于遥感图像的多模态小目标检测[J]. 图学学报, 2022, 43(2): 197-204.
[2]	李妮妮, 王夏黎, 付阳阳, 郑凤仙, 何丹丹, 袁绍欣. 一种优化 YOLO 模型的交通警察目标检测方法[J]. 图学学报, 2022, 43(2): 296-305.
[3]	蒋镕圻, 彭月平, 谢文宣, 谢郭蓉. 嵌入 scSE 模块的改进 YOLOv4 小目标检测算法[J]. 图学学报, 2021, 42(4): 546-555.
[4]	黄凯奇, 赵鑫, 李乔哲, 胡世宇 . 视觉图灵：从人机对抗看计算机视觉下一步发展[J]. 图学学报, 2021, 42(3): 339-348.
[5]	叶泽聪 , 高志强 , 崔翛龙 , 蒋镕圻 , . 基于模型压缩的 YOLOV3 实时枪支识别方法[J]. 图学学报, 2021, 42(2): 198-205.
[6]	张胜虎, 马惠敏 . 遮挡对于目标检测的影响分析[J]. 图学学报, 2020, 41(6): 891-896.
[7]	翟晓卉，史健勇. BIM 和GIS 的空间语义数据集成方法及应用研究[J]. 图学学报, 2020, 41(1): 148-157.

基于透视降采样和神经网络的地面标志检测

Landmark detection based on perspective down-sampling and neural network

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 7

编辑推荐

Metrics

本文评价