欢迎访问《图学学报》 分享到:

图学学报 ›› 2026, Vol. 47 ›› Issue (1): 99-110.DOI: 10.11996/JG.j.2095-302X.2026010099

• 图像处理与计算机视觉 • 上一篇    下一篇

BSD-YOLO:基于动态稀疏注意力与自适应检测头的小目标车辆检测方法

杨彪, 王学, 官铮(), 龙萍   

  1. 云南大学信息学院云南 昆明 650504
  • 收稿日期:2025-06-16 接受日期:2025-08-18 出版日期:2026-02-28 发布日期:2026-03-16
  • 通讯作者:官铮,E-mail:guanzheng@ynu.edu.cn
  • 基金资助:
    国家自然科学基金(61761045);云南专家工作站项目(202305AF150045)

BSD-YOLO: a small target vehicle detection method based on dynamic sparse attention and adaptive detection head

YANG Biao, WANG Xue, GUAN Zheng(), LONG Ping   

  1. School of Information Science and Engineering, Yunnan University, Kunming Yunnan 650504, China
  • Received:2025-06-16 Accepted:2025-08-18 Published:2026-02-28 Online:2026-03-16
  • Supported by:
    National Natural Science Foundation of China(61761045);Yunnan Expert Workstation(202305AF150045)

摘要:

在智能交通监控系统中,复杂场景下的小目标车辆检测面临特征分辨率低、遮挡干扰严重、模型计算冗余及边界框回归精度不足等挑战。为兼顾检测精度与边缘设备部署效率,提出一种基于动态稀疏注意力与轻量化双分支结构的改进YOLOv8检测框架。首先设计双向路由稀疏注意力机制(ReBiAttention),通过双层动态路由筛选关键特征,增强对小目标浅层特征的保留能力;随后结合GSConv与VoV-GSCSP模块,在减小计算量的同时动态调整多尺度特征权重;并在检测头部分引入改进型DynamicHead结构,实现多任务自适应优化;最后改进ShapeIoU损失函数,引入形状与尺度感知机制,提升定位精度。在UA-DETRAC数据集上的实验表明,改进模型较基线YOLOv8n的Precision,Recall与mAP@0.5分别提升8.739%,1.685%和7.225%,参数量减少4.3%。该方法为复杂交通场景下的小目标车辆精准检测提供了高效解决方案。

关键词: YOLOv8, 注意力机制, 轻量化, 深度学习, 小目标检测

Abstract:

In intelligent traffic monitoring systems, small target vehicle detection in complex scenes faces challenges such as low feature resolution, severe occlusion interference, computational redundancy, and insufficient bounding-box regression accuracy. To balance detection accuracy with deployment efficiency on edge devices, an improved YOLOv8 framework based on dynamic sparse attention and a lightweight dual-branch structure was proposed. The method first introduced a bidirectional routing sparse attention mechanism (ReBiAttention) that enhanced the retention of shallow features for small targets by dynamically filtering key features through a two-level routing strategy. Subsequently, GSConv and VoV-GSCSP modules were integrated to reduce computational cost while dynamically adjusting multi-scale feature weights. An improved DynamicHead was applied for multi-task adaptive optimization, and a modified ShapeIoU loss function with shape- and scale-aware weighting was employed to improve localization accuracy. Experiments on the UA-DETRAC dataset showed that, relative to baseline YOLOv8n, Precision, Recall, and mAP@0.5 increased by 8.739%, 1.685%, and 7.225%, respectively, while the parameter count decreased by 4.3%. This method provided an efficient solution for accurate detection of small-target vehicles in complex traffic scenarios.

Key words: YOLOv8, sparse attention, lightweight, deep learning, small target detection

中图分类号: