欢迎访问《图学学报》 分享到:

图学学报 ›› 2022, Vol. 43 ›› Issue (2): 197-204.DOI: 10.11996/JG.j.2095-302X.2022020197

• 图像处理与计算机视觉 • 上一篇    下一篇

基于遥感图像的多模态小目标检测

  

  1. 南京航空航天大学计算机科学与技术学院,江苏 南京 210016
  • 出版日期:2022-04-30 发布日期:2022-05-07
  • 基金资助:
    国家自然科学基金项目(62072235)

Multimodal small target detection based on remote sensing image

  1. College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing Jiangsu 210016, China
  • Online:2022-04-30 Published:2022-05-07
  • Supported by:
    National Natural Science Foundation of China (62072235)

摘要: 由于遥感图像目标往往较小且容易受光线、天气等因素的影响,所以单一模态下基于深度学习
的遥感图像目标检测的准确度较低。然而,不同模态间的图像信息可以相互增强提高目标检测的性能。因此,
基于 RGB 和红外图像,提出了一种适用于遥感图像多模态小目标检测的平衡多模态深度模型。相比简单地相
加、点乘和拼接的方式融合 2 个模态的特征信息,设计了一种平衡多模态特征的方法增强目标特征,以弥补单
一模态信息不足的缺点。首先分别对 RGB 和红外图像进行浅层特征提取;其次,融合 2 个模态的特征信息并
进行深层的特征提取;然后,基于 YOLOv4 方法,构建了多模态小目标检测模型。最后,基于 VEDAI 数据集,
在遥感图像多模态小目标检测实验结果中验证了该方法的有效性。

关键词: 遥感图像, 平衡多模态深度模型, 小目标检测, 融合, VEDAI 数据集

Abstract: Since targets in remote sensing images are relatively small and easily affected by illumination, weather, and
other factors, deep-learning based target detection methods from single modality remote sensing images suffer from
low accuracy. However, the image information between different modalities can enhance each other to improve the
performance of target detection. Therefore, based on RGB and infrared images fusion, we proposed a balanced
multimodal depth model (BMDM) for multimodal small target detection from remote sensing images. As opposed to
simple element-wise summation, element-wise multiplication, and concatenation to fuse the feature information of the
two modalities, we designed a balanced multimodal feature method to enhance target features to make up for the
shortcomings of single modal information. We first extracted low-level features from RGB and infrared images,
respectively. Secondly, we fused the feature information of the two modalities and extracted deep-level features.
Thirdly, we constructed a multimodal small target detection model based on the one-stage method. Finally, the
effectiveness of the proposed method was verified by the experimental results of multimodal small target detection
performed on the public dataset VEDAI of remote sensing images.

Key words:  remote sensing images, balanced multimodal deep model, small target detection, fusion, VEDAI dataset

中图分类号: