Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2023, Vol. 44 ›› Issue (5): 849-860.DOI: 10.11996/JG.j.2095-302X.2023050849

• Image Processing and Computer Vision • Previous Articles     Next Articles

Detection method of dropped anti-vibration hammer for transmission line based on improved Cascade RCNN

YAN Guang-wei(), LIU Run-ze, JIAO Run-hai(), HE Hui   

  1. School of Control and Computer Engineering, North China Electric Power University, Beijing 102200, China
  • Received:2022-12-30 Accepted:2023-05-15 Online:2023-10-31 Published:2023-10-31
  • Contact: JIAO Run-hai (1977-), professor, Ph.D. His main research interests cover image recognition, machine learning and data mining. E-mail:runhaijiao@ncepu.edu.cn
  • About author:YAN Guang-wei (1971-), associate professor, Ph.D. His main research interests cover computer graphics, image and information security. E-mail:yan_guang_wei@126.com
  • Supported by:
    National Natural Science Foundation of China(62073133)

Abstract:

During the inspection of transmission lines using Unmanned Aerial Vehicle (UAV), there are many dropped anti-vibration hammers that become obstructed by wires or are shot from a distance. This challenge leads to the occlusion of target features and low resolution. In addition, the close proximity of a number of hammers due to sliding poses challenges to the accuracy of target identification. To address the above problems, a deep neural network based on an improved Cascade RCNN was proposed to identify the dropped anti-vibration hammers. The proposed network mainly achieved improvements from the following four aspects. First of all, a contrastive learning network was designed to compare the features of positive and negative samples with real samples. By utilizing a contrastive loss function during network training, the network became more attentive to the blocked dropped anti-vibration hammers and enhanced its feature extraction ability. Secondly, the classifier was enhanced. The selection of interested regions with better regression performance in the cascade structure was filtered and input directly into the final classification regression queue. This improved the classification performance of the classifier, thereby enhancing the classification scores of the detected targets. Thirdly, a parallel attention mechanism module was designed to integrate the extracted features from the network, increasing the weights of key features and directing the network’s attention to more critical features in the image. In addition, in the process of feature fusion of the feature pyramid, the bilinear interpolation method was replaced with deconvolution to enhance the feature restoration capability. The experimental results demonstrated that the improved model achieved a recall rate of 97.5%, precision of 91.0%, and average precision of 92.0%, an improvement of 6.9%, 28.4%, and 8.0%, respectively, compared with the baseline model.

Key words: transmission line, dropped anti-vibration hammer, Cascade RCNN, contrastive learning network, parallel attention module, classifier enhancement, sample similarity

CLC Number: