欢迎访问《图学学报》 分享到:

图学学报 ›› 2023, Vol. 44 ›› Issue (2): 249-259.DOI: 10.11996/JG.j.2095-302X.2023020249

• 图像处理与计算机视觉 • 上一篇    下一篇

二阶段锚框和类均衡损失的遥感图像目标检测

曾伦杰1(), 储珺1,2(), 陈昭俊2   

  1. 1.南昌航空大学信息工程学院,江西 南昌 330063
    2.南昌航空大学软件学院,江西 南昌 330063
  • 收稿日期:2022-08-12 接受日期:2022-10-10 出版日期:2023-04-30 发布日期:2023-05-01
  • 通讯作者: 储珺(1967-),女,教授,博士。主要研究方向为复杂场景的目标检测和跟踪。E-mail:chuj@nchu.edu.cn
  • 作者简介:曾伦杰(1997-),男,硕士研究生。主要研究方向为深度学习与目标检测。E-mail:13576563600@163.com
  • 基金资助:
    国家自然科学基金项目(62162045);江西省重点研发计划项目(20192BBE50073)

Object detection in remote sensing image based on two-stage anchor and class balanced loss

ZENG Lun-jie1(), CHU Jun1,2(), CHEN Zhao-jun2   

  1. 1. School of Information Engineering, Nanchang Hangkong University, Nanchang Jiangxi 330063, China
    2. School of Software Engineering, Nanchang Hangkong University, Nanchang Jiangxi 330063, China
  • Received:2022-08-12 Accepted:2022-10-10 Online:2023-04-30 Published:2023-05-01
  • Contact: CHU Jun (1967-), professor, Ph.D. Her main research interests cover object detection and tracking in complex scenes. E-mail:chuj@nchu.edu.cn
  • About author:ZENG Lun-jie (1997-), master student. His main research interests cover deep learning and object detection. E-mail:13576563600@163.com
  • Supported by:
    National Natural Science Foundation of China(62162045);Jiangxi Province Key R&D Program Project(20192BBE50073)

摘要:

由于现有遥感图像数据集中不同类别目标的数量差异大,数据集中存在类别分布不平衡问题,影响网络模型对少数类别的检测精度。针对以上问题,提出了二阶段锚框和类均衡损失的遥感图像目标检测算法。通过K-means聚类生成遥感数据集的类平衡标签,再将得到的类平衡标签作为第二阶段K-means聚类的初始中心,生成的预设锚框能够兼顾少数类别尺度,提高少数类别实例的检测精度。同时构建类别平衡损失(CEQL),在平衡损失(EQL)的基础上,采用有效样本构建辅助权重,提高模型在训练过程中对少数类别的关注度。实验表明,改进后模型的平均准确率均值、少数类别平均准确率分别达到76.13%和76.51%,对比基准网络分别提高了1.56%和1.75%。在DOIR和NWPU VHR-10数据集上,与主流方法Faster-RCNN,RetinaNet,CenterNet,YOLOv4,YOLOX-L,YOLOv5及YOLOv7等进行了对比,实验表明改进后的算法能够在保证多数类别检测精度的基础上,有效提高了少数类别的检测精度。

关键词: 遥感检测, 类不平衡, 重加权, K-means, YOLOv4

Abstract:

Due to the large differences in the number of different categories of targets in the existing remote sensing image data collection, the distribution of categories in the dataset is unbalanced, affecting the detection accuracy of network models for a few categories. In light of the aforementioned challenges, a two-stage anchor frame and class-balanced loss target detection algorithm for remote sensing images was presented. The class balance labels of remote sensing datasets were generated by K-means clustering, subsequently utilized as the initial center for the second stage of K-means clustering. The resulting preset anchor frames were able to take into account a few class scales and improve the detection accuracy of a few class instances. At the same time, class equalization loss (CEQL) was constructed. Based on equalization loss (EQL), effective samples were used to construct auxiliary weights to improve the model′s attention to a few categories during training. The experimental results demonstrated that the improved model achieved an average accuracy of 76.13% and a few categories′ average accuracy of 76.51%, increasing by 1.56% and 1.75%, respectively, compared with the datum network. When evaluated on the DOIR and NWPU VHR-10 datasets, and compared with the main methods such as Faster-RCNN, RetinaNet, CenterNet, YOLOv4, YOLOX-L, YOLOv5, and YOLOv7, the experiment showed that the improved algorithm could effectively improve the detection accuracy of a few categories while maintaining the detection accuracy of most categories.

Key words: remote sensing detection, class imbalance, re-weighting, K-means, YOLOv4

中图分类号: