Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2023, Vol. 44 ›› Issue (5): 890-898.DOI: 10.11996/JG.j.2095-302X.2023050890

• Image Processing and Computer Vision • Previous Articles     Next Articles

A dense pedestrian detection algorithm with improved YOLOv8

GAO Ang1(), LIANG Xing-zhu1,2(), XIA Chen-xing1, ZHANG Chun-jiong3   

  1. 1. School of Computer Science and Engineering, Anhui University of Science and Technology, Huainan Anhui 232001, China
    2. Institute of Environment-friendly Materials and Occupational Health, Anhui University of Science and Technology, Wuhu Anhui 241003, China
    3. College of Electronics and Information Engineering, Tongji University, Shanghai 201804, China
  • Received:2023-05-15 Accepted:2023-07-24 Online:2023-10-31 Published:2023-10-31
  • Contact: LIANG Xing-zhu (1979-), associate professor, master. His main research interests cover pattern recognition, computer vision, etc. E-mail:xzliang@aust.edu.cn
  • About author:GAO Ang (1999-), master student. His main research interests cover object detection and image processing. E-mail:2021201221@aust.edu.cn
  • Supported by:
    National Natural Science Foundation of China(62102003);Research Foundation of the Institute of Environment-Friendly Materials and Occupational Health (Wuhu), Anhui University of Science and Technology(ALW2021YF04);Science and Technology Research Project of Wuhu City(2020yf48)

Abstract:

In response to the challenge of detecting small-scale, occluded pedestrians in dense scenes, where they are prone to being missed, we proposed an improved YOLOv8 detection algorithm. First, to address the issue of extracting features from small-scale pedestrians, a backbone network improved by deformable convolution was employed to enhance the feature extraction capability of the network, and an occlusion-aware attention mechanism was designed to enhance the visible part of the occluded pedestrian features. Second, to address imprecise localization of the detection head in dense pedestrian scenes, a dynamic decoupling head was designed to enhance attention to multi-scale pedestrian features, thereby improving the expression capability of the detection head. Finally, to address the problem of low model training efficiency, the regression loss that combined Wise-IoU with distributed focus loss was utilized for training, thereby enhancing the convergence ability of the model. Through the analysis of experimental results, the improved YOLOv8 algorithm demonstrated exceptional performance on two challenging and dense pedestrian datasets, namely CrowdHuman and WiderPerson, achieving an AP50 of 90.6% and 92.3% and an AP50:95 of 62.5% and 68.2%, respectively. In contrast to the original algorithm, the improvements were substantial, establishing robust competitiveness when compared with other state-of-the-art pedestrian detection models. The proposed algorithm exhibited a wide range of applications in dense pedestrian detection tasks.

Key words: YOLOv8, dense pedestrian detection, occlusion-aware attention, deformable convolution, dynamic decoupled head

CLC Number: