欢迎访问《图学学报》 分享到:

图学学报 ›› 2024, Vol. 45 ›› Issue (6): 1328-1337.DOI: 10.11996/JG.j.2095-302X.2024061328

• 图像处理与计算机视觉 • 上一篇    下一篇

基于YOLOv8的轻量化无人机图像目标检测算法

闫建红(), 冉同霄   

  1. 太原师范学院计算机科学与技术学院,山西 晋中 030619
  • 收稿日期:2024-06-13 接受日期:2024-09-26 出版日期:2024-12-31 发布日期:2024-12-24
  • 第一作者:闫建红(1972-),女,教授,博士。主要研究方向为机器学习、计算机视觉。E-mail:yan_jian_hong@163.com
  • 基金资助:
    山西省重点研发计划项目(202102010101008);山西省研究生精品教学案例项目(2024AL27)

Lightweight UAV image target detection algorithm based on YOLOv8

YAN Jianhong(), RAN Tongxiao   

  1. School of Computer Science and Technology, Taiyuan Normal University, Jinzhong Shanxi 030619, China
  • Received:2024-06-13 Accepted:2024-09-26 Published:2024-12-31 Online:2024-12-24
  • First author:YAN Jianhong (1972-), professor, Ph.D. Her main research interests cover machine learning, computer vision. E-mail:yan_jian_hong@163.com
  • Supported by:
    Shanxi Province Key Research and Development Project(202102010101008);Shanxi Province Graduate Excellence Teaching Case Project(2024AL27)

摘要:

针对无人机图像目标像素低、背景复杂、模型部署难等问题,提出一种基于YOLOv8的轻量级多尺度特征融合小目标检测算法。为了降低网络参数量,提高模型检测速度,使用fasternet block替换C2f的bottleneck,构建轻量化特征提取模块FasterC2f;为了增强模型多尺度特征融合能力,设计全新的聚焦扩散特征金字塔结构,使颈部网络每层特征图都聚焦三层特征信息;设计共享卷积检测头,在优化模型参数量的同时,让每个检测头都包含不同尺度特征信息;重构小目标检测网络,采用更大尺度的三层检测头,提高模型对小目标的特征学习能力。在Visdrone数据集上的实验结果表明,与YOLOv8s相比,该模型的精确率、召回率和mAP分别提高了5.1%,5.4%和6.6%,参数量降低了68%,模型文件体积减少了15.3 MB,FPS提高了16%,表明该模型具有检测精度高、检测速度快、模型易部署等优点。

关键词: YOLOv8, 无人机, 小目标检测, 轻量化, 特征融合

Abstract:

To address the problems of low target pixels, complex backgrounds, and difficult model deployment in unmanned aerial vehicle (UAV) images, a lightweight multi-scale feature fusion small target detection algorithm based on YOLOv8 was proposed. In order to reduce the number of network parameters and improve the model detection speed, the FasterNet block was used to replace the bottleneck of C2f, resulting in the construction of a lightweight feature extraction module, FasterC2f. To enhance the multi-scale feature fusion ability of the model, a new focus diffusion feature was designed that enables each layer feature map of the neck network to focus on three layers of feature information. A shared convolution detection head was designed, allowing each detection head to contain feature information from different scales while optimizing the model parameters. The small target detection network was reconstructed to utilize a larger-scale three-layer detection head, improving the model’s feature learning capability for small targets. Experimental results on the Visdrone data set indicated that compared with YOLOv8s, the precision rate, recall rate, and mAP of this model increased by 5.1%, 5.4%, and 6.6%, respectively. The number of parameters was reduced by 68%, and the model file size decreased by 15.3 MB, while FPS increased by 16%. These results demonstrated that the model possesses advantages in high detection accuracy, fast detection speed, and ease of deployment.

Key words: YOLOv8, UAV, small target detection, lightweight, feature fusion

中图分类号: