Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2025, Vol. 46 ›› Issue (6): 1327-1336.DOI: 10.11996/JG.j.2095-302X.2025061327

• Image Processing and Computer Vision • Previous Articles     Next Articles

Multi object detection method for surface defects of steel arch towers based on YOLOv8-OSRA

WANG Haihan()   

  1. Guangzhou Engineering Construction Command of China Railway Guangzhou Group Company Limited, Guangzhou Guangdong 510180, China
  • Received:2024-10-25 Accepted:2025-03-16 Online:2025-12-30 Published:2025-12-27
  • About author:First author contact:

    WANG Haihan (1973-), senior engineer, master. His main research interests cover structural health monitoring, image processing, computer vision, etc. E-mail:wanghaihan@126.com

Abstract:

Steel arch towers, as the primary load-bearing structures of steel arch cable-stayed bridges, require early detection and assessment of their surface defects—such as corrosion, spalling, and cracks—to ensure structural safety. However, traditional manual inspection methods are inefficient, highly subjective, and unable to access concealed areas at high altitudes. To address these challenges, an intelligent detection method based on an improved YOLOv8n-Seg deep-learning framework integrated with the OSRA attention mechanism was proposed. High-resolution internal images of steel arch towers were collected using a self-developed rail-guided inspection robot system. A comprehensive dataset containing 5 846 original images was constructed from both collected and open-source data, and data augmentation techniques—including random cropping, mirroring, and brightness adjustment—expanded the dataset to 23 378 images. At the algorithmic level, the OSRA attention module was innovatively embedded into the feature fusion layer of the YOLOv8n-Seg network. By leveraging an overlapping patching strategy and a local refinement mechanism, the model’s ability to capture irregular boundaries and small-scale defect features was significantly enhanced. Experimental results demonstrated that the optimized YOLOv8-OSRA model achieved notable performance improvements on an independent test set: corrosion detection mAP@0.5 reached 90.9% (+2.6%), crack identification accuracy reached 87.0% (+1.1%), and spalling detection accuracy reached 81.9% (+2.1%). Ablation experiments further confirmed that the OSRA module, while maintaining computational efficiency (increasing GFLOPs by only 0.8%), outperformed conventional attention mechanisms such as SE and CBAM. The findings provided a lightweight and deployable solution for steel arch tower defect detection, and the proposed multi-scale feature enhancement approach offered valuable insights for detecting surface defects in complex steel structures.

Key words: bridge engineering, wheel rail inspection robot, surface defect detection, computer vision, deep learning, YOLOv8

CLC Number: