融合注意力机制的肠道息肉分割多尺度卷积神经网络

doi:10.11996/JG.j.2095-302X.2023010050

图学学报 ›› 2023, Vol. 44 ›› Issue (1): 50-58.DOI: 10.11996/JG.j.2095-302X.2023010050

• 图像处理与计算机视觉 • 上一篇下一篇

融合注意力机制的肠道息肉分割多尺度卷积神经网络

单芳湄¹(), 王梦文¹, 李敏¹^,²()

1.南京理工大学计算机科学与工程学院，江苏南京 210094
2.山东省数字医学与计算机辅助手术重点实验室，山东青岛 266003

收稿日期:2022-03-24 修回日期:2022-08-05 出版日期:2023-10-31 发布日期:2023-02-16
通讯作者: 李敏
作者简介:单芳湄(1997-)，女，硕士研究生。主要研究方向为图像分割。E-mail：1094264762@qq.com
基金资助:
国家自然科学基金项目(61501241);江苏省自然科学基金项目(BK20150792);山东省数字医学与计算机辅助手术重点实验室开放基金项目(SDKL-DMCAS-2018-04);江苏省交通运输科技项目(2021Y)

Multi-scale convolutional neural network incorporating attention mechanism for intestinal polyp segmentation

SHAN Fang-mei¹(), WANG Meng-wen¹, LI Min¹^,²()

1. School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing Jiangsu 210094, China
2. Shandong Key Laboratory of Digital Medicine and Computer Assisted Surgery, Qingdao Shandong 266003, China

Received:2022-03-24 Revised:2022-08-05 Online:2023-10-31 Published:2023-02-16
Contact: LI Min
About author:SHAN Fang-mei (1997-), master student. Her main research interest covers image segmentation. E-mail：1094264762@qq.com
Supported by:
National Natural Science Foundation of China(61501241);National Natural Science Foundation of Jiangsu Province(BK20150792);Foundation of Shandong Provincial Key Laboratory of Digital Medicine and Computer Assisted Surgery(SDKL-DMCAS-2018-04);Transport Science and Technology Project of Jiangsu Province(2021Y)

摘要/Abstract

摘要：

肠道息肉分割能够提供息肉在结肠中的位置和形态信息，方便医生依据其结构变化程度来推断癌变可能性，有利于结肠癌的早期诊断和治疗。针对许多现有的卷积神经网络所提取的多尺度特征有限，且常引入冗余和干扰特征，难以应对复杂多变的肠道息肉分割问题，提出了一种融合注意力机制的肠道息肉分割多尺度卷积神经网络(CNN)。首先，设计不同比例金字塔池化策略提取丰富的多尺度上下文信息；然后，通过在网络中融入通道注意力机制，模型能够根据目标自适应地选择合适的局部上下文信息和全局上下文信息进行特征集成；最后，联合金字塔池化策略和通道注意力机制构建多尺度有效语义融合解码网络，增强模型对形状、大小复杂多变的肠道息肉分割的鲁棒性。实验结果表明，本文模型分割的Dice系数、IoU和灵敏度在CVC-ClinicDB数据集上分别为90.6%，84.4%和91.1%，在ETIS-Larib数据集上分别为80.6%，72.6%和79.0%，其能够从肠镜图像中准确、有效地分割出肠道息肉。

关键词: 息肉分割, 肠镜图像, 卷积神经网络, 多尺度语义信息

Abstract:

Intestinal polyp segmentation provides the location and morphology of polyps in colon, allowing doctors to infer the possibility of canceration according to the degree of structural deformation, which facilitates the early diagnosis and treatment of colon cancer. In view of the limited multi-scale features extracted by many existing convolutional neural networks (CNN), and the frequently caused redundant and interfering features, it is difficult to extract the complex and variable targets. To address this challenge, a multi-scale convolutional neural network incorporating attention mechanism was proposed for intestinal polyp segmentation. Specifically, the pyramid strategy based on different scales of pooling was designed to capture the rich multi-scale context information. Then a channel attention mechanism was incorporated into the network so that the model could adaptively select appropriate local and global contextual information for feature integration based on the region of interest. Following that, by combining the pyramid pooling strategy and the channel attention mechanism, a multi-scale effective semantic fusion decoder network was constructed to improve the model robustness for segmentation of intestinal polyps with complex and variable shapes and sizes. The experimental results show that the Dice coefficient, IoU, and sensitivity produced by the proposed model reach 90.6%, 84.4%, and 91.1% on the CVC-ClinicDB dataset, and 80.6%, 72.6%, and 79.0% on the ETIS-Larib dataset, indicating that the proposed model could accurately and effectively segments polyps in colonoscopy images.

Key words: polyp segmentation, colonoscopy images, convolutional neural network, multiscale semantic information

中图分类号:

TP391

单芳湄, 王梦文, 李敏. 融合注意力机制的肠道息肉分割多尺度卷积神经网络[J]. 图学学报, 2023, 44(1): 50-58.

SHAN Fang-mei, WANG Meng-wen, LI Min. Multi-scale convolutional neural network incorporating attention mechanism for intestinal polyp segmentation[J]. Journal of Graphics, 2023, 44(1): 50-58.

图/表 9

图1 融合注意力机制的多尺度卷积神经网络整体架构图

Fig. 1 Overall framework of the multi-scale convolutional neural network incorporating attention mechanism

图2 多尺度有效语义融合模块

Fig. 2 Multi-scale effective semantic fusion module

图3 压缩-激励块

Fig. 3 Squeeze-and-excitation block

图4 不同方法在CVC-ClinicDB数据集上的分割结果((a)典型示例1；(b)典型示例2；(c)典型示例3)

Fig. 4 Segmentation results of different methods on the CVC-ClinicDB dataset ((a) Classic example 1; (b) Classic example 2; (c) Classic example 3)

表1 不同方法在CVC-ClinicDB数据集上的分割性能比较(%)

Table 1 Comparison of segmentation performance of different methods on the CVC-ClinicDB dataset (%)

方法	Dice	IoU	Sensitivity	Precision
FCN	80.9	75.3	79.4	81.4
BiONet	84.8	77.8	85.0	89.5
Attention UNet	87.2	80.2	86.9	91.5
UNet++	87.4	77.3	82.2	92.1
UNet	88.1	83.5	89.3	95.2
MultiResUNet	88.5	82.7	85.1	96.3
Ours	90.6	84.4	91.1	92.1

图5 不同方法在ETIS-Larib数据集上的分割结果((a)典型示例1；(b)典型示例2；(c)典型示例3)

Fig. 5 Segmentation results of different methods on the ETIS-Larib dataset ((a) Classic example 1; (b) Classic example 2; (c) Classic example 3)

表2 不同方法在ETIS-Larib数据集上的分割性能比较(%)

Table 2 Comparison of segmentation performance of different methods on the ETIS-Larib dataset (%)

方法	Dice	IoU	Sensitivity	Precision
FCN	59.7	53.9	61.3	68.2
UNet++	65.3	54.8	56.6	79.6
Attention UNet	69.4	61.6	66.9	88.2
UNet	73.3	66.9	71.2	82.1
Dil. ResFCN	75.5	71.1	71.9	87.7
Double UNet	76.2	72.1	73.3	83.9
Ours	80.6	72.6	79.0	88.0

表3 在CVC-ClinicDB数据集上的消融实验

Table 3 Ablation experiments on the CVC-ClinicDB dataset

方法	Dice (%)	IoU (%)	Sensitivity (%)	Precision (%)	FLOPs (M)	Params (M)
基准网络	88.1	83.5	89.3	95.2	57.9	28.9
基准网络+金字塔池化	89.3	82.6	90.7	90.8	60.6	30.3
基准网络+多尺度有效语义融合	90.6	84.4	91.1	92.1	64.1	32.1

表4 在ETIS-Larib数据集上的消融实验(%)

Table 4 Ablation experiments on the ETIS-Larib dataset (%)

方法	Dice	IoU	Sensitivity	Precision
基准网络	73.3	66.9	71.2	82.1
基准网络+ 金字塔池化	78.6	70.4	78.1	84.9
基准网络+多尺度有效语义融合	80.6	72.6	79.0	88.0

参考文献 14

[1]	MAHMUD T, PAUL B, FATTAH S A. PolypSegNet: a modified encoder-decoder architecture for automated polyp segmentation from colonoscopy images[J]. Computers in Biology and Medicine, 2021, 128: 104119. DOI URL
[2]	JHA D, SMEDSRUD P H, JOHANSEN D, et al. A comprehensive study on colorectal polyp segmentation with ResUNet++, conditional random field and test-time augmentation[J]. IEEE Journal of Biomedical and Health Informatics, 2021, 25(6): 2029-2040. DOI PMID
[3]	LONG J, SHELHAMER E, DARRELL T. Fully convolutional networks for semantic segmentation[C]//2015 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2015: 3431-3440.
[4]	RONNEBERGER O, FISCHER P, BROX T. U-net: convolutional networks for biomedical image segmentation[M]// Lecture Notes in Computer Science. Cham: Springer International Publishing, 2015: 234-241.
[5]	IBTEHAZ N, RAHMAN M S. MultiResUNet: rethinking the U-Net architecture for multimodal biomedical image segmentation[J]. Neural Networks, 2020, 121: 74-87. DOI URL
[6]	GUO Y B, BERNAL J, MATUSZEWSKI B J. Polyp segmentation with fully convolutional deep neural networks-extended evaluation study[J]. Journal of Imaging, 2020, 6(7): 69. DOI URL
[7]	XIANG T G, ZHANG C Y, LIU D N, et al. BiO-net: learning recurrent Bi-directional connections for encoder-decoder architecture[M]//Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. Cham: Springer International Publishing, 2020: 74-84.
[8]	SCHLEMPER J, OKTAY O, SCHAAP M, et al. Attention gated networks: learning to leverage salient regions in medical images[J]. Medical Image Analysis, 2019, 53: 197-207. DOI PMID
[9]	HU J, SHEN L, SUN G. Squeeze-and-excitation networks[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2018: 7132-7141.
[10]	FAN D P, JI G P, ZHOU T, et al. PraNet: parallel reverse attention network for polyp segmentation[M]//Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. Cham: Springer International Publishing, 2020: 263-273.
[11]	BERNAL J, SÁNCHEZ F J, FERNÁNDEZ-ESPARRACH G, et al. WM-DOVA maps for accurate polyp highlighting in colonoscopy: validation vs. saliency maps from physicians[J]. Computerized Medical Imaging and Graphics, 2015, 43: 99-111. DOI URL
[12]	SILVA J, HISTACE A, ROMAIN O, et al. Toward embedded detection of polyps in WCE images for early diagnosis of colorectal cancer[J]. International Journal of Computer Assisted Radiology and Surgery, 2014, 9(2): 283-293. DOI PMID
[13]	ZHOU Z W, SIDDIQUEE M M R, TAJBAKHSH N, et al. UNet++: redesigning skip connections to exploit multiscale features in image segmentation[J]. IEEE Transactions on Medical Imaging, 2020, 39(6): 1856-1867. DOI PMID
[14]	JHA D, RIEGLER M A, JOHANSEN D, et al. DoubleU-net: a deep convolutional neural network for medical image segmentation[C]// 2020 IEEE 33rd International Symposium on Computer-Based Medical Systems. New York: IEEE Press, 2020: 558-564.

[1]	毕春艳, 刘越. 基于深度学习的视频人体动作识别综述[J]. 图学学报, 2023, 44(4): 625-639.
[2]	李鑫, 普园媛, 赵征鹏, 徐丹, 钱文华. 内容语义和风格特征匹配一致的艺术风格迁移[J]. 图学学报, 2023, 44(4): 699-709.
[3]	邓渭铭, 杨铁军, 李纯纯, 黄琳. 基于神经网络架构搜索的铭牌目标检测方法[J]. 图学学报, 2023, 44(4): 718-727.
[4]	杨柳, 吴晓群. 基于深度学习的三维形状补全研究综述[J]. 图学学报, 2023, 44(2): 201-215.
[5]	潘东辉, 金映含, 孙旭, 刘玉生, 张东亮. CTH-Net：从线稿和颜色点生成服装图像的CNN-Transformer混合网络[J]. 图学学报, 2023, 44(1): 120-130.
[6]	张盾, 黄志开, 王欢, 吴义鹏, 王颖, 邹家豪. 基于多尺度特征实现超参进化的野生菌分类研究与应用[J]. 图学学报, 2022, 43(4): 580-589.
[7]	廖志伟, 金兢, 张超凡, 杨学志. 基于分层压缩激励的 ASPP 网络单目深度估计[J]. 图学学报, 2022, 43(2): 214-222.
[8]	苏常保, 龚世才. 基于深度学习的人物肖像全自动抠图算法[J]. 图学学报, 2022, 43(2): 247-253.
[9]	何国忠, 梁宇. 基于卷积神经网络的 PCB 缺陷检测[J]. 图学学报, 2022, 43(1): 21-27.
[10]	汪玉金, 谢诚, 余蓓蓓, 向鸿鑫, 柳青. 属性语义与图谱语义融合增强的零次学习图像识别[J]. 图学学报, 2021, 42(6): 899-907.
[11]	张成 , 侯宇超 , 焦宇倩 , 白艳萍 , 李建军 . 基于三通道分离特征融合与支持向量机的混凝土图像分类研究[J]. 图学学报, 2021, 42(6): 917-923.
[12]	马欢, 冀晶晶, 刘佳豪, 刘雨婷. 面向机器人自主分割的肉品识别分类系统实现[J]. 图学学报, 2021, 42(6): 924-930.
[13]	封筠 , 赵颖 , 毕健康 , 赖柏江 , 胡晶晶 . 多级卷积神经网络的沥青路面裂缝图像层次化筛选[J]. 图学学报, 2021, 42(5): 719-728.
[14]	张明华 , 牛玉莹 , 杜艳玲 , 黄冬梅 , 刘刻福 . 基于残差 3DCNN 和三维 Gabor 滤波器的高光谱图像分类[J]. 图学学报, 2021, 42(5): 729-737.
[15]	满开亮, 汪友生, 刘继荣. 基于稠密残差网络的图像超分辨率重建算法[J]. 图学学报, 2021, 42(4): 556-562.

融合注意力机制的肠道息肉分割多尺度卷积神经网络

Multi-scale convolutional neural network incorporating attention mechanism for intestinal polyp segmentation

RichHTML

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

图/表 9

参考文献 14

相关文章 15

编辑推荐

Metrics

本文评价