欢迎访问《图学学报》 分享到:

图学学报 ›› 2022, Vol. 43 ›› Issue (2): 296-305.DOI: 10.11996/JG.j.2095-302X.2022020296

• 图像处理与计算机视觉 • 上一篇    下一篇

一种优化 YOLO 模型的交通警察目标检测方法

  

  1. 长安大学信息工程学院,陕西 西安 710064
  • 出版日期:2022-04-30 发布日期:2022-05-07
  • 基金资助:
    国家重点研发计划项目(2020YFB1600400)

A traffic police object detection method based on optimized YOLO model

  1. School of Information Engineering, Chang’an University, Xi’an Shaanxi 710064, China
  • Online:2022-04-30 Published:2022-05-07
  • Supported by:
    National Key R&D Program of China (2020YFB1600400)

摘要: 针对复杂交通场景中交通警察目标检测与定位准确率低的问题,提出一种优化 YOLOv4 模型的
交通警察目标检测方法。首先,采用 4 种随机转换方式对自建的交通警察数据集进行扩充,解决了模型过拟合
问题并提高模型的泛化能力;其次,将 YOLOv4 主干网络替换为 MobileNet 并引入 Inception-Resnet-v1 结构,
有效地减少了参数总量并加深了网络层数;然后,使用 K-means++聚类算法对自建数据集进行聚类分析以重新
定义网络的初始候选框,提高了交通警察目标深度特征的学习效率;最后,引入焦点损失函数以优化分类损失
函数,解决了训练中正负样本数量不平衡问题。研究结果表明,优化后的 YOLOv4 模型大小仅 50 M,AP 值达
98.01%,与 Faster R-CNN,YOLOv3 和原始 YOLOv4 模型相比均有提升。有效解决了目前复杂交通场景中交
通警察目标的漏检、误检及检测精度低等问题。

关键词: 交通警察目标检测, YOLOv4 模型, K-means++聚类算法, 深度特征学习, 焦点损失函数

Abstract: To tackle the problems of low accuracy of detection and localization for traffic police object in complex
traffic scenes, a method to detect traffic police object based on the optimized YOLOv4 model was proposed in this
study. Firstly, four random transformation methods were employed to expand the self-built traffic police data set, so as
to solve the problem of model over-fitting and improve the generalization ability of the network model. Secondly, the
YOLOv4 backbone network was replaced with the lightweight MobileNet. The Inception-Resnet-v1 structure was
introduced to reduce the parameter numbers and deepen the network layers of the model effectively. Then, the
K-means++ clustering algorithm was adopted to perform clustering analysis on the self-built data set. In doing so, the
initial candidate frame of the network was redefined, and the learning efficiency was improved for traffic police object
depth features. Finally, to address the problem of the imbalance of positive and negative samples in the process of
network training, the focus loss function was introduced to optimize the classification loss function. Experimental
results demonstrate that the size of the optimized YOLOv4 model is only 50 M and the AP value reaches up to 98.01%.
compared with Faster R-CNN, YOLOv3, and the original YOLOv4 model, the optimized network has been
significantly improved. The proposed method can effectively solve the problems of missed detection, false detection, and low accuracy for traffic police object in current complex traffic scenes.

Key words: traffic police object detection, YOLOv4 model, K-means++clustering algorithm, deep feature learning,
focus loss function

中图分类号: