Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2024, Vol. 45 ›› Issue (4): 770-778.DOI: 10.11996/JG.j.2095-302X.2024040770

• Image Processing and Computer Vision • Previous Articles     Next Articles

Research on multi-scale road damage detection algorithm based on attention mechanism

WU Bing(), TIAN Ying()   

  1. School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan Liaoning 114051, China
  • Received:2024-04-26 Accepted:2024-06-28 Online:2024-08-31 Published:2024-09-03
  • Contact: TIAN Ying
  • About author:First author contact:

    WU Bing (1999), MS candidate, his research interests include computer vision, deep learning. E-mail:2258860606@qq.com

  • Supported by:
    National Natural Science Foundation of China(62072086);Funded by Liaoning Provincial Department of Education(LJKM20220646)

Abstract:

Road damage detection is an important task in road maintenance and repair. The existing road damage detection methods primarily rely on traditional manual detection, which requires significant manpower and material resources, resulting in low detection efficiency and an inability to meet the needs of current road development.To address these problems, an improved multi-scale road damage detection algorithm, YOLOv8-RDD, was proposed. Firstly, the YOLOv8-RDD algorithm employed Deformable Convolutional Networks (DCN) in the C2f module to build a new C2f_DCN module. This expanded the effective range of the receptive field and located the boundary and position of target objects more accurately, thus enhancing the ability to identify and locate the target. At the end of backbone network, a new SPPF_GS module was designed, introducing the Self-Attention (SA) mechanism and the Phantom Convolution Ghost module into the SPPF module, with the size of pooled kernel re-optimized to better deal with long-distance dependence and capture global information. Finally, Coordinate Attention (CA) was introduced into the Neck to strengthen the feature extraction ability of the model and reduce redundant information. Experimental results demonstrated that the improved algorithm achieved a Precision of 61.1%, a Recall rate of 55.5%, and a mean average precision (mAP) of 56.2% on the RDD2022 dataset. Compared with the YOLOv8n algorithm, the results were improved by 4.6%, 4.7%, and 5.2%, respectively, which achieved excellent performance in the target detection of road damage.

Key words: road damage detection, YOLOv8, deformable convolutional networks, attention mechanism, Ghost module

CLC Number: