Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2024, Vol. 45 ›› Issue (5): 901-912.DOI: 10.11996/JG.j.2095-302X.2024050901

• Image Processing and Computer Vision • Previous Articles     Next Articles

TFD-YOLOv8: a transmission line foreign object detection method

WANG Yaru(), FENG Lilong, SONG Xiaoke, QU Zhuo, YANG Ke, WANG Qianming(), ZHAI Yongjie   

  1. Department of Automation, North China Electric Power University, Baoding Hebei 071003, China
  • Received:2024-05-08 Revised:2024-06-25 Online:2024-10-31 Published:2024-10-31
  • Contact: WANG Qianming
  • About author:First author contact:

    WANG Yaru (1990-), lecturer, Ph.D. Her main research interests cover detection of transmission line components, etc. E-mail:wangyaru@ncepu.edu.cn

  • Supported by:
    Youth Fund of the National Natural Science Foundation of China Project(62303184);National Natural Science Foundation of China Joint Fund Project Key Support Project(U21A20486);The National Nature Science Foundation of China(62373151);Fundamental Research Funds for the Central Universities(2023JC006);Fundamental Research Funds for the Central Universities(2024MS136)

Abstract:

Foreign object detection based on UAV aerial images is an important aspect for intelligent inspection of transmission lines. The YOLO target detection algorithm is high in accuracy and speed, making it the current mainstream algorithm. However, when carrying out transmission line foreign object detection, due to the variable scale and insignificant features of foreign object targets, problems such as misdetection and omission detection would arise. A YOLOv8 model (transmission line foreign detection-YOLOv8, TFD-YOLOv8) was proposed for transmission line foreign object detection. A two-branch downsampling module was constructed in the YOLOv8s neck network to intercept the scale-related detail information easily lost during the downsampling process, achieving the efficient fusion of semantic and detail information and improving the information consistency of feature maps at different scales. Then, a mix-enhancement attention module was inserted into the backbone network to simultaneously extract global and local features of the image, generating spatial attention and channel attention, respectively, and resulting in a mix-enhancement attention including local, global, spatial, and channel information. This enhanced the network’s ability to capture the key features of the targe. The experimental results showed that compared with the baseline model, the proposed method improved the average detection accuracy by 6.7%, and the accuracy and recall by 12.9% and 5.1%, respectively. This method demonstrated advantages in terms of detection accuracy and complexity compared with several existing target detection methods.

Key words: YOLOv8, transmission line, object detection, downsampling, mix-enhancement

CLC Number: