欢迎访问《图学学报》 分享到:

图学学报 ›› 2023, Vol. 44 ›› Issue (2): 233-240.DOI: 10.11996/JG.j.2095-302X.2023020233

• 图像处理与计算机视觉 • 上一篇    下一篇

基于P-CenterNet的光学遥感图像烟囱检测

谢国波(), 贺笛轩, 何宇钦, 林志毅()   

  1. 广东工业大学计算机学院,广东 广州 510006
  • 收稿日期:2022-06-05 接受日期:2022-08-10 出版日期:2023-04-30 发布日期:2023-05-01
  • 通讯作者: 林志毅(1979-),男,讲师,博士。主要研究方向为人工智能、生物信息学等。E-mail:lzy291@gdut.edu.cn
  • 作者简介:谢国波(1977-),男,教授,博士。主要研究方向为计算智能及其在遥感影像处理应用、高光谱遥感、复杂疾病模式挖掘等。E-mail:xiegb@gdut.edu.cn
  • 基金资助:
    国家自然科学基金项目(61802072)

P-CenterNet for chimney detection in optical remote-sensing images

XIE Guo-bo(), HE Di-xuan, HE Yu-qin, LIN Zhi-yi()   

  1. School of Computer Science, Guangdong University of Technology, Guangzhou Guangdong 510006, China
  • Received:2022-06-05 Accepted:2022-08-10 Online:2023-04-30 Published:2023-05-01
  • Contact: LIN Zhi-yi (1979-), lecturer, Ph.D. His main research interests cover artificial intelligence, bioinformatics, etc. E-mail:lzy291@gdut.edu.cn
  • About author:XIE Guo-bo (1977-), professor, Ph.D. His main research interests cover computational intelligence and its application to remote sensing image processing, hyperspectral remote sensing, complex disease pattern mining, etc. E-mail:xiegb@gdut.edu.cn
  • Supported by:
    National Natural Science Foundation of China(61802072)

摘要:

工业烟囱排放是城市空气污染的主要因素之一,城市环境质量与烟囱数量成反比。因此,烟囱位置检测对城市环境检测和治理具有积极的影响。在烟囱检测任务中,针对光学遥感图像背景复杂、目标小,存在大量相似对象导致的检测精度低的问题,提出了一种基于CenterNet的检测器P-CenterNet。首先,为了获得更丰富的语义特征,P-CenterNet使用了金字塔卷积取代骨干网络中的普通卷积;其次,并行于骨干网络设计了一个多尺度上下文特征提取模块来保留有助于从背景区域中区分对象区域的低级特征信息;最后,增加了一个卷积块注意力模块进一步提取骨干网络的输出特征,提高检测器对小目标的表达能力。实验使用了DIOR这个大规模的公开数据集来验证模型的有效性,采用线上、线下2种增强手段对数据集进行扩充,增强模型的鲁棒性。结果表明,与Faster-RCNN和YOLOv3这类模型相比,P-CenterNet在检测时间成本相近的情况下,明显提高了检测精度,mAP达到了89.77%。

关键词: 烟囱检测, 光学遥感图像, CenterNet, 金字塔卷积, 注意力机制

Abstract:

Industrial chimney emissions are among the primary drivers of urban air pollution, with the urban environment quality inversely related to the quantity of chimneys therein. Therefore, the detection of chimney placement exerts a positive impact on urban environmental detection and governance. To address the problem of low detection accuracy caused by complex backgrounds and small targets in optical remote sensing images with numerous similar objects, P-CenterNet, a CenterNet-based detector, has been proposed for chimney detection tasks. Firstly, P-CenterNet employed pyramidal convolution in the backbone network to obtain richer linguistic features, instead of normal convolution in the backbone network. Secondly, a multi-scale contextual feature extraction module was designed in parallel with the backbone network to retain low-level feature information that helped distinguish object regions from background regions. Finally, a convolutional block attention module was added to further extract the output features of the backbone network to improve the detector′s representation of small targets. Furthermore, DIOR, a large-scale public dataset, was applied for the validation of the model in the experiments. The dataset was expanded and the robustness of the model was enhanced via both online and offline enhancements. The results indicated that P-CentreNet could significantly improve detection accuracy with a similar detection time cost, compared with other models such as Faster-RCNN and YOLOv3, with mAP reaching 89.77%.

Key words: chimney detection, optical remote sensing images, CenterNet, pyramidal convolution, attention mechanism

中图分类号: