Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2023, Vol. 44 ›› Issue (4): 691-698.DOI: 10.11996/JG.j.2095-302X.2023040691

Previous Articles     Next Articles

Text detection method for electrical equipment nameplates based on deep learning

WANG Dao-lei(), KANG Bo, ZHU Rui()   

  1. College of Energy and Mechanical Engineering, Shanghai University of Electric Power, Shanghai 200090, China
  • Received:2022-11-08 Accepted:2023-01-12 Online:2023-08-31 Published:2023-08-16
  • Contact: ZHU Rui (1981-), associate professor. Ph.D. Her main research interests cover insulator detection, transformation equipment fault detection and deep learning. E-mail:zhuruish@163.com
  • About author:

    WANG Dao-lei (1981-), professor, Ph.D. His main research interests cover computer vision, image processing and CAD/CAM. E-mail:alfredwdl@shiep.edu.cn

  • Supported by:
    National Natural Science Foundation of China(61502297)

Abstract:

The prompt detection of power equipment nameplates can help the complete transformer substations and power plants to efficiently comprehend device information and perform necessary maintenance, thus ensuring the proper functioning. This thesis addressed the problem of enhancing text detection efficiency while also taking into account the improvement of precision. To that end, we introduced the concept of convolutional block attention module (CBAM) into the DBNet, and improved the detection head. Multi-scale feature feature pyramid networks (FPN) structures were introduced into the backbone network, improving upon the original FPN. Meanwhile, in view of the absence of public data for power equipment nameplates and difficulties in obtaining it, we proposed a technique to enhance the data by cutting nameplate images into rectangles and then splicing them together into a new image. In this way, the data set could be effectively expanded. The experimental results showed that both the data enhancement method and the improved DBNet network structure proposed in this paper have played a role in improving the detection performance, surpassing most current text detection network structures on the market. The improved DBNet network structure combined with data enhancement method yielded a precision rate of 90.3% and a recalling rate of 79.7%. The rate of F-measure also increased to nearly 84.7%, a 3.3% improvement over the original model, indicating that the detection performance was greatly improved while the loss of detecting speed changes remained minimal.

Key words: text detection, DBNet, CBAM, data enhancement, electrical equipment nameplates

CLC Number: