Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2023, Vol. 44 ›› Issue (1): 50-58.DOI: 10.11996/JG.j.2095-302X.2023010050

• Image Processing and Computer Vision • Previous Articles     Next Articles

Multi-scale convolutional neural network incorporating attention mechanism for intestinal polyp segmentation

SHAN Fang-mei1(), WANG Meng-wen1, LI Min1,2()   

  1. 1. School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing Jiangsu 210094, China
    2. Shandong Key Laboratory of Digital Medicine and Computer Assisted Surgery, Qingdao Shandong 266003, China
  • Received:2022-03-24 Revised:2022-08-05 Online:2023-10-31 Published:2023-02-16
  • Contact: LI Min
  • About author:SHAN Fang-mei (1997-), master student. Her main research interest covers image segmentation. E-mail:1094264762@qq.com
  • Supported by:
    National Natural Science Foundation of China(61501241);National Natural Science Foundation of Jiangsu Province(BK20150792);Foundation of Shandong Provincial Key Laboratory of Digital Medicine and Computer Assisted Surgery(SDKL-DMCAS-2018-04);Transport Science and Technology Project of Jiangsu Province(2021Y)

Abstract:

Intestinal polyp segmentation provides the location and morphology of polyps in colon, allowing doctors to infer the possibility of canceration according to the degree of structural deformation, which facilitates the early diagnosis and treatment of colon cancer. In view of the limited multi-scale features extracted by many existing convolutional neural networks (CNN), and the frequently caused redundant and interfering features, it is difficult to extract the complex and variable targets. To address this challenge, a multi-scale convolutional neural network incorporating attention mechanism was proposed for intestinal polyp segmentation. Specifically, the pyramid strategy based on different scales of pooling was designed to capture the rich multi-scale context information. Then a channel attention mechanism was incorporated into the network so that the model could adaptively select appropriate local and global contextual information for feature integration based on the region of interest. Following that, by combining the pyramid pooling strategy and the channel attention mechanism, a multi-scale effective semantic fusion decoder network was constructed to improve the model robustness for segmentation of intestinal polyps with complex and variable shapes and sizes. The experimental results show that the Dice coefficient, IoU, and sensitivity produced by the proposed model reach 90.6%, 84.4%, and 91.1% on the CVC-ClinicDB dataset, and 80.6%, 72.6%, and 79.0% on the ETIS-Larib dataset, indicating that the proposed model could accurately and effectively segments polyps in colonoscopy images.

Key words: polyp segmentation, colonoscopy images, convolutional neural network, multiscale semantic information

CLC Number: