基于 RGB-D 的反向融合实例分割算法

doi:10.11996/JG.j.2095-302X.2021050767

图学学报 ›› 2021, Vol. 42 ›› Issue (5): 767-774.DOI: 10.11996/JG.j.2095-302X.2021050767

• 图像处理与计算机视觉 • 上一篇下一篇

基于 RGB-D 的反向融合实例分割算法

合肥工业大学计算机与信息学院，安徽合肥 230009

出版日期:2021-10-31 发布日期:2021-11-03
基金资助:
国家自然科学基金项目(61876057，61971177)

A reverse fusion instance segmentation algorithm based on RGB-D

School of Computer and Information, Hefei University of Technology, Hefei Anhui 230009, China

Online:2021-10-31 Published:2021-11-03
Supported by:
National Natural Science Foundation of China (61876057, 61971177)

摘要/Abstract

摘要： RGB-D 图像在提供场景 RGB 信息的基础上添加了 Depth 信息，可以有效地描述场景的色彩及三维几何信息。结合 RGB 图像及 Depth 图像的特点，提出一种将高层次的语义特征反向融合到低层次的边缘细节特征的反向融合实例分割算法。该方法通过采用不同深度的特征金字塔网络(FPN)分别提取 RGB 与 Depth 图像特征，将高层特征经上采样后达到与最底层特征同等尺寸，再采用反向融合将高层特征融合到低层，同时在掩码分支引入掩码优化结构，从而实现 RGB-D 的反向融合实例分割。实验结果表明，反向融合特征模型能够在 RGB-D 实例分割的研究中获得更加优异的成绩，有效地融合了 Depth 图像与彩色图像 2 种不同特征图像特征，在使用 ResNet-101 作为骨干网络的基础上，与不加入深度信息的 Mask R-CNN 相比平均精度提高 10.6%，比直接正向融合 2 种特征平均精度提高 4.5%。

关键词: Depth 图像, 实例分割, 特征融合, 反向融合, 掩码优化

Abstract: The RGB-D images add the Depth information with the given RGB information of the scene, which can effectively describe the color and three-dimensional geometric information of the scene. With the integration of the characteristics of RGB image and Depth image, this paper proposed a reverse fusion instance segmentation algorithm that reversely merged high-level semantic features to low-level edge detail features. In order to achieve RGB-D reverse fusion instance segmentation, this method extracted RGB and depth image features separately using feature pyramid networks (FPN) of different depths, upsampling high-level features to the same size as the bottom-level features. Then reverse fusion was utilized to fuse the high-level features to the low-level, and at the same time mask optimization structurewas introduced to mask branch. The experimental results show that the proposed reverse fusion feature model can produce more excellent results in the research on RGB-D instance segmentation, effectively fusing two different feature image features of Depth image and color image. On the basis of ResNet-101 serving as the backbone network, compared with mask R-CNN without depth information, the average accuracy was increased by 10.6%, and that of the two features was increased by 4.5% with the direct forward fusion.

Key words: Depth images, instance segmentation, feature fusion, reverse fusion, mask refinement

中图分类号:

TP 391

汪丹丹, 张旭东, 范之国, 孙锐. 基于 RGB-D 的反向融合实例分割算法[J]. 图学学报, 2021, 42(5): 767-774.

WANG Dan-dan, ZHANG Xu-dong, FAN Zhi-guo, SUN Rui . A reverse fusion instance segmentation algorithm based on RGB-D [J]. Journal of Graphics, 2021, 42(5): 767-774.

[1]	王素琴, 任琪, 石敏, 朱登明. 基于异常检测的产品表面缺陷检测与分割[J]. 图学学报, 2022, 43(3): 377-386.
[2]	李扬科, 宋全博, 周元峰. 用于手势识别的时空融合网络以及虚拟签名系统[J]. 图学学报, 2022, 43(3): 504-512.
[3]	张运波, 易鹏飞, 周东生, 张强, 魏小鹏. 深度可分离卷积和标准卷积相结合的高效行人检测器[J]. 图学学报, 2022, 43(2): 230-238.
[4]	苏常保, 龚世才. 基于深度学习的人物肖像全自动抠图算法[J]. 图学学报, 2022, 43(2): 247-253.
[5]	刘玉杰, 张敏杰, 李宗民, 李华. 基于全局姿态感知的轻量级人体姿态估计[J]. 图学学报, 2022, 43(2): 333-341.
[6]	史彩娟, 陈厚儒, 葛录录, 王子雯. 注意力残差多尺度特征增强的显著性实例分割[J]. 图学学报, 2021, 42(6): 883-890.
[7]	张成 , 侯宇超 , 焦宇倩 , 白艳萍 , 李建军 . 基于三通道分离特征融合与支持向量机的混凝土图像分类研究[J]. 图学学报, 2021, 42(6): 917-923.
[8]	牟琦, 张寒, 何志强, 李占利 . 基于深度估计和特征融合的尺度自适应目标跟踪算法[J]. 图学学报, 2021, 42(4): 563-571.
[9]	张鹏飞 , 石志良 , 李晓垚 , 欧阳祥波 . 基于深度学习的主轴承盖分类识别算法[J]. 图学学报, 2021, 42(4): 572-580.
[10]	张繁, 尹鑫, 徐宇扬, 郝鹏翼 . 基于多尺度特征提取的多导联心跳信号分类[J]. 图学学报, 2021, 42(4): 581-589.
[11]	官申珂, 林晓, 郑晓妹, 朱媛媛, 马利庄 . 结合超像素分割的多尺度特征融合图像语义分割算法 [J]. 图学学报, 2021, 42(3): 406-413.
[12]	何也, 张旭东, 吴迪. 特征融合网络：多通道信息融合的光场深度估计 [J]. 图学学报, 2020, 41(6): 922-929.
[13]	梁正兴 , 王先兵 , 何涛 , 吴中鼎 , 张嘉 . 实例分割和边缘优化算法的研究与实现[J]. 图学学报, 2020, 41(6): 939-936.
[14]	董美宝 1，杨涵文 1,2，郭文 1，马思源 1，郑创 1 . 多特征重检测的相关滤波无人机视觉跟踪[J]. 图学学报, 2019, 40(6): 1079-1086.
[15]	钟国崇，储珺，缪君 . 特征融合自适应目标跟踪[J]. 图学学报, 2018, 39(5): 939-944.

基于 RGB-D 的反向融合实例分割算法

A reverse fusion instance segmentation algorithm based on RGB-D

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价