Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2023, Vol. 44 ›› Issue (3): 531-539.DOI: 10.11996/JG.j.2095-302X.2023030531

Previous Articles     Next Articles

Semantic segmentation with fusion of spatial criss-cross and channel multi-head attention

WU Wen-huan(), ZHANG Hao-kun   

  1. School of Electrical and Information Engineering, Hubei University of Automotive Technology, Shiyan Hubei 442002, China
  • Received:2022-10-05 Accepted:2023-02-22 Online:2023-06-30 Published:2023-06-30
  • About author:

    WU Wen-huan (1985-), associate professor, Ph.D. His main research interests cover computer vision and image processing, etc. E-mail:wuwenhuan5@163.com

  • Supported by:
    Natural Science Fund Project of Hubei Province(2022CFB538);Ph.D Research Startup Fund Project of Hubei University of Automotive Technology(BK202004)

Abstract:

In light of the shortcomings of current semantic segmentation methods, which suffer from ineffective construction of contextual semantic associations and insufficient representation of extracted semantic features, a novel semantic segmentation network that combines spatial criss-cross attention and channel attention was proposed. Firstly, the spatial criss-cross attention module (SCCAM) was adopted to aggregate context information of each target pixel in the horizontal and vertical directions, thus enabling efficient construction of non-local semantic dependencies between pixels. Secondly, the multi-head attention mechanism was introduced in the channel attention module (CAM) to mine channel features with more significant semantics on multiple channel subspaces. Finally, the semantic representation capability was strengthened by merging attention features on both spatial and channel dimensions, thereby improving the precision of semantic segmentation. The experimental results on several datasets, including Cityscapes, PASCAL VOC2012, and CamVid demonstrated that the proposed network model outperformed other state-of-the-art semantic segmentation methods in terms of segmentation accuracy.

Key words: semantic segmentation, neural networks, attention mechanism, space attention, channel attention

CLC Number: