基于遥感图像的多模态小目标检测

doi:10.11996/JG.j.2095-302X.2022020197

图学学报 ›› 2022, Vol. 43 ›› Issue (2): 197-204.DOI: 10.11996/JG.j.2095-302X.2022020197

• 图像处理与计算机视觉 • 上一篇下一篇

基于遥感图像的多模态小目标检测

南京航空航天大学计算机科学与技术学院，江苏南京 210016

出版日期:2022-04-30 发布日期:2022-05-07
基金资助:
国家自然科学基金项目(62072235)

Multimodal small target detection based on remote sensing image

College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing Jiangsu 210016, China

Online:2022-04-30 Published:2022-05-07
Supported by:
National Natural Science Foundation of China (62072235)

摘要/Abstract

摘要： 由于遥感图像目标往往较小且容易受光线、天气等因素的影响，所以单一模态下基于深度学习
的遥感图像目标检测的准确度较低。然而，不同模态间的图像信息可以相互增强提高目标检测的性能。因此，
基于 RGB 和红外图像，提出了一种适用于遥感图像多模态小目标检测的平衡多模态深度模型。相比简单地相
加、点乘和拼接的方式融合 2 个模态的特征信息，设计了一种平衡多模态特征的方法增强目标特征，以弥补单
一模态信息不足的缺点。首先分别对 RGB 和红外图像进行浅层特征提取；其次，融合 2 个模态的特征信息并
进行深层的特征提取；然后，基于 YOLOv4 方法，构建了多模态小目标检测模型。最后，基于 VEDAI 数据集，
在遥感图像多模态小目标检测实验结果中验证了该方法的有效性。

关键词: 遥感图像, 平衡多模态深度模型, 小目标检测, 融合, VEDAI 数据集

Abstract: Since targets in remote sensing images are relatively small and easily affected by illumination, weather, and
other factors, deep-learning based target detection methods from single modality remote sensing images suffer from
low accuracy. However, the image information between different modalities can enhance each other to improve the
performance of target detection. Therefore, based on RGB and infrared images fusion, we proposed a balanced
multimodal depth model (BMDM) for multimodal small target detection from remote sensing images. As opposed to
simple element-wise summation, element-wise multiplication, and concatenation to fuse the feature information of the
two modalities, we designed a balanced multimodal feature method to enhance target features to make up for the
shortcomings of single modal information. We first extracted low-level features from RGB and infrared images,
respectively. Secondly, we fused the feature information of the two modalities and extracted deep-level features.
Thirdly, we constructed a multimodal small target detection model based on the one-stage method. Finally, the
effectiveness of the proposed method was verified by the experimental results of multimodal small target detection
performed on the public dataset VEDAI of remote sensing images.

Key words: remote sensing images, balanced multimodal deep model, small target detection, fusion, VEDAI dataset

中图分类号:

TP 753

胡俊, 顾晶晶, 王秋红. 基于遥感图像的多模态小目标检测[J]. 图学学报, 2022, 43(2): 197-204.

HU Jun, GU Jing-jing, WANG Qiu-hong. Multimodal small target detection based on remote sensing image[J]. Journal of Graphics, 2022, 43(2): 197-204.

[1]	马彦博, 李琳, 陈缘, 赵洋, 胡锐. 基于时空融合的多帧压缩视频增强方法[J]. 图学学报, 2022, 43(4): 651-658.
[2]	王素琴, 任琪, 石敏, 朱登明. 基于异常检测的产品表面缺陷检测与分割[J]. 图学学报, 2022, 43(3): 377-386.
[3]	王文亮, 陈纯毅, 胡小娟, 于海洋, 田野. 融合阴影图和深度划分阴影体的阴影渲染算法[J]. 图学学报, 2022, 43(3): 478-485.
[4]	李扬科, 宋全博, 周元峰. 用于手势识别的时空融合网络以及虚拟签名系统[J]. 图学学报, 2022, 43(3): 504-512.
[5]	张运波, 易鹏飞, 周东生, 张强, 魏小鹏. 深度可分离卷积和标准卷积相结合的高效行人检测器[J]. 图学学报, 2022, 43(2): 230-238.
[6]	苏常保, 龚世才. 基于深度学习的人物肖像全自动抠图算法[J]. 图学学报, 2022, 43(2): 247-253.
[7]	刘玉杰, 张敏杰, 李宗民, 李华. 基于全局姿态感知的轻量级人体姿态估计[J]. 图学学报, 2022, 43(2): 333-341.
[8]	唐静, 彭伟龙, 唐可可, 方美娥. 基于多视图网络三维形状检索的通用扰动攻击[J]. 图学学报, 2022, 43(1): 93-100.
[9]	张成 , 侯宇超 , 焦宇倩 , 白艳萍 , 李建军 . 基于三通道分离特征融合与支持向量机的混凝土图像分类研究[J]. 图学学报, 2021, 42(6): 917-923.
[10]	林森 , 刘旭 . 门控融合对抗网络的水下图像增强 [J]. 图学学报, 2021, 42(6): 948-956.
[11]	汪丹丹, 张旭东, 范之国, 孙锐. 基于 RGB-D 的反向融合实例分割算法[J]. 图学学报, 2021, 42(5): 767-774.
[12]	蒋镕圻, 彭月平, 谢文宣, 谢郭蓉. 嵌入 scSE 模块的改进 YOLOv4 小目标检测算法[J]. 图学学报, 2021, 42(4): 546-555.
[13]	牟琦, 张寒, 何志强, 李占利 . 基于深度估计和特征融合的尺度自适应目标跟踪算法[J]. 图学学报, 2021, 42(4): 563-571.
[14]	张鹏飞 , 石志良 , 李晓垚 , 欧阳祥波 . 基于深度学习的主轴承盖分类识别算法[J]. 图学学报, 2021, 42(4): 572-580.
[15]	张繁, 尹鑫, 徐宇扬, 郝鹏翼 . 基于多尺度特征提取的多导联心跳信号分类[J]. 图学学报, 2021, 42(4): 581-589.

基于遥感图像的多模态小目标检测

Multimodal small target detection based on remote sensing image

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价