Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2023, Vol. 44 ›› Issue (2): 249-259.DOI: 10.11996/JG.j.2095-302X.2023020249

Previous Articles     Next Articles

Object detection in remote sensing image based on two-stage anchor and class balanced loss

ZENG Lun-jie1(), CHU Jun1,2(), CHEN Zhao-jun2   

  1. 1. School of Information Engineering, Nanchang Hangkong University, Nanchang Jiangxi 330063, China
    2. School of Software Engineering, Nanchang Hangkong University, Nanchang Jiangxi 330063, China
  • Received:2022-08-12 Accepted:2022-10-10 Online:2023-04-30 Published:2023-05-01
  • Contact: CHU Jun (1967-), professor, Ph.D. Her main research interests cover object detection and tracking in complex scenes. E-mail:chuj@nchu.edu.cn
  • About author:ZENG Lun-jie (1997-), master student. His main research interests cover deep learning and object detection. E-mail:13576563600@163.com
  • Supported by:
    National Natural Science Foundation of China(62162045);Jiangxi Province Key R&D Program Project(20192BBE50073)

Abstract:

Due to the large differences in the number of different categories of targets in the existing remote sensing image data collection, the distribution of categories in the dataset is unbalanced, affecting the detection accuracy of network models for a few categories. In light of the aforementioned challenges, a two-stage anchor frame and class-balanced loss target detection algorithm for remote sensing images was presented. The class balance labels of remote sensing datasets were generated by K-means clustering, subsequently utilized as the initial center for the second stage of K-means clustering. The resulting preset anchor frames were able to take into account a few class scales and improve the detection accuracy of a few class instances. At the same time, class equalization loss (CEQL) was constructed. Based on equalization loss (EQL), effective samples were used to construct auxiliary weights to improve the model′s attention to a few categories during training. The experimental results demonstrated that the improved model achieved an average accuracy of 76.13% and a few categories′ average accuracy of 76.51%, increasing by 1.56% and 1.75%, respectively, compared with the datum network. When evaluated on the DOIR and NWPU VHR-10 datasets, and compared with the main methods such as Faster-RCNN, RetinaNet, CenterNet, YOLOv4, YOLOX-L, YOLOv5, and YOLOv7, the experiment showed that the improved algorithm could effectively improve the detection accuracy of a few categories while maintaining the detection accuracy of most categories.

Key words: remote sensing detection, class imbalance, re-weighting, K-means, YOLOv4

CLC Number: