欢迎访问《图学学报》 分享到:

图学学报 ›› 2025, Vol. 46 ›› Issue (6): 1327-1336.DOI: 10.11996/JG.j.2095-302X.2025061327

• 图像处理与计算机视觉 • 上一篇    下一篇

基于YOLOv8-OSRA的钢拱塔表观病害多目标检测方法

王海涵()   

  1. 中国铁路广州局集团有限公司广州工程建设指挥部广东 广州 510180
  • 收稿日期:2024-10-25 接受日期:2025-03-16 出版日期:2025-12-30 发布日期:2025-12-27
  • 第一作者:王海涵(1973-),男,高级工程师,硕士。主要研究方向为结构健康监测、图像处理、计算机视觉等。E-mail:wanghaihan@126.com

Multi object detection method for surface defects of steel arch towers based on YOLOv8-OSRA

WANG Haihan()   

  1. Guangzhou Engineering Construction Command of China Railway Guangzhou Group Company Limited, Guangzhou Guangdong 510180, China
  • Received:2024-10-25 Accepted:2025-03-16 Published:2025-12-30 Online:2025-12-27
  • First author:WANG Haihan (1973-), senior engineer, master. His main research interests cover structural health monitoring, image processing, computer vision, etc. E-mail:wanghaihan@126.com

摘要: 钢拱塔是钢拱斜拉桥的主要承重结构,其表观病害(如腐蚀、剥落和裂缝等)的早期检测与评估对保障桥梁结构安全至关重要。针对传统人工检测方法效率低下、主观性强且难以覆盖高空隐蔽区域的问题。提出了一种基于改进YOLOv8n-Seg深度学习框架与OSRA注意力机制的智能检测方法。利用自主研发的轮轨式检测机器人系统采集钢拱塔内部高分辨率图像数据,结合开源数据集构建了包含5 846张原始图像的多病害数据集,并通过随机裁剪、镜像翻转和亮度调整等数据增强技术将样本量扩展至23 378张图像。在算法设计层面,创新性地将OSRA注意力模块嵌入YOLOv8n-Seg网络的特征融合层,通过重叠分块策略和局部细化机制显著提升了模型对不规则边界和微小病害特征的捕捉能力。结果表明:优化后的YOLOv8-OSRA模型在独立测试集上取得了显著的性能提升,锈蚀检测mAP@0.5达到90.9% (提升2.6%),裂缝识别精度达到87.0% (提升1.1%),剥落检测准确率达到81.9% (提升2.1%)。消融实验进一步验证了OSRA模块在保持计算效率(仅增加0.8% GFLOPs)的同时,其性能优势显著优于SE和CBAM等注意力机制。研究成果为钢拱塔病害检测提供可部署于移动检测设备的轻量化解决方案,且提出的多尺度特征增强方法为复杂钢结构表面缺陷检测提供参考。

关键词: 桥梁工程, 轮轨式检测机器人, 表观病害检测, 机器视觉, 深度学习, YOLOv8

Abstract:

Steel arch towers, as the primary load-bearing structures of steel arch cable-stayed bridges, require early detection and assessment of their surface defects—such as corrosion, spalling, and cracks—to ensure structural safety. However, traditional manual inspection methods are inefficient, highly subjective, and unable to access concealed areas at high altitudes. To address these challenges, an intelligent detection method based on an improved YOLOv8n-Seg deep-learning framework integrated with the OSRA attention mechanism was proposed. High-resolution internal images of steel arch towers were collected using a self-developed rail-guided inspection robot system. A comprehensive dataset containing 5 846 original images was constructed from both collected and open-source data, and data augmentation techniques—including random cropping, mirroring, and brightness adjustment—expanded the dataset to 23 378 images. At the algorithmic level, the OSRA attention module was innovatively embedded into the feature fusion layer of the YOLOv8n-Seg network. By leveraging an overlapping patching strategy and a local refinement mechanism, the model’s ability to capture irregular boundaries and small-scale defect features was significantly enhanced. Experimental results demonstrated that the optimized YOLOv8-OSRA model achieved notable performance improvements on an independent test set: corrosion detection mAP@0.5 reached 90.9% (+2.6%), crack identification accuracy reached 87.0% (+1.1%), and spalling detection accuracy reached 81.9% (+2.1%). Ablation experiments further confirmed that the OSRA module, while maintaining computational efficiency (increasing GFLOPs by only 0.8%), outperformed conventional attention mechanisms such as SE and CBAM. The findings provided a lightweight and deployable solution for steel arch tower defect detection, and the proposed multi-scale feature enhancement approach offered valuable insights for detecting surface defects in complex steel structures.

Key words: bridge engineering, wheel rail inspection robot, surface defect detection, computer vision, deep learning, YOLOv8

中图分类号: