Segmentation of laser coding characters based on residual and feature-grouped attention

doi:10.11996/JG.j.2095-302X.2023030482

Abstract

Abstract:

Laser coding on metal surface can lead to denaturation of surrounding metal and generate a significant amount of noise in the form of burns. This results in complex backgrounds in the character region, low contrast, and ambiguity of characters, which can make subsequent character recognition challenging. In response, Res18-UNet, a novel laser coding character feature enhancement and fine segmentation model, was proposed. The proposed model was based on residual and feature-grouped attention to highlight character information and improve signal-to-noise ratio, thus effectively segmenting the target. Firstly, the A-R unit was designed to reduce network parameters, effectively avoid network degradation, and improve the feature selection ability in channels and spaces. Secondly, the feature-grouped attention mechanism was proposed, and the improved spatial attention was added to enhance weak character features. In addition, a deep supervision module integrating the improved loss function was designed in the upsampling stage to improve network convergence and enhance segmentation precision. According to the experiment on the image dataset of the can bottoms with laser coding, the proposed model outperformed the original UNet model in terms of mIoU, Dice coefficient, and F1 score. Specifically, the proposed model achieved 0.801 0, 0.889 5, and 0.903 5, respectively, and attained the prediction speed 2.6 times that of the original UNet at 12.24 images/s. Experiments have proven that this algorithm can effectively enhance the features of low contrast laser coded characters and segment them with high precision, and that it has the feasibility and application prospect of deployment and operation on embedded platforms.

Key words: laser coding characters, image segmentation, spatial attention mechanism, residual neural network, feature group strategy

CLC Number:

TP391

XIAO Tian-xing, WU Jing-jing. Segmentation of laser coding characters based on residual and feature-grouped attention[J]. Journal of Graphics, 2023, 44(3): 482-491.

Figures/Tables 14

References 22

[1]	孙晓娜, 刘继超, 高国华. 基于视觉的乳品包装日期喷码缺陷检测技术[J]. 食品与机械, 2018, 34(10): 100-103, 108.
	SUN X N, LIU J C, GAO G H. Study on visual code-based defect detection technology for production date of dairy packaging[J]. Food & Machinery, 2018, 34(10): 100-103, 108. (in Chinese)
[2]	马玲, 罗晓曙, 蒋品群. 基于模板匹配和支持向量机的点阵字符识别研究[J]. 计算机工程与应用, 2020, 56(4): 134-139. DOI
	MA L, LUO X S, JIANG P Q. Research on dot matrix character recognition based on template matching and support vector machine[J]. Computer Engineering and Applications, 2020, 56(4): 134-139. (in Chinese) DOI
[3]	林冬婷, 程洋, 欧阳, 等. 基于喷点融合特征的点阵字符分割方法[J]. 制造业自动化, 2021, 43(8): 52-57.
	LIN D T, CHENG Y, OU Y, et al. Detection method of dot matrix character based on spray fusion feature[J]. Manufacturing Automation, 2021, 43(8): 52-57. (in Chinese)
[4]	张家财, 张良力, 曾飞. 钢坯表面点印字符图像自适应阈值分割方法[J]. 现代电子技术, 2021, 44(19): 49-54.
	ZHANG J C, ZHANG L L, ZENG F. Adaptive thresholding segmentation method for dot-printed character image on billet surface[J]. Modern Electronics Technique, 2021, 44(19): 49-54. (in Chinese)
[5]	汤勃, 孔建益, 王兴东, 等. 钢板表面低对比度微小缺陷图像增强和分割[J]. 中国图象图形学报, 2020, 25(1): 81-91.
	TANG B, KONG J Y, WANG X D, et al. Image enhancement and segmentation algorithm for low-contrast small defects on steel plate[J]. Journal of Image and Graphics, 2020, 25(1): 81-91. (in Chinese)
[6]	BARTHAKUR M, SARMA K K. Complex image Segmentation using K-means clustering aided neuro-computing[C]// The 5th International Conference on Signal Processing and Integrated Networks. New York: IEEE Press, 2018: 327-331.
[7]	李镇锋, 陈晓荣, 陈梦华, 等. 基于图像熵和傅里叶变换的复杂背景分割[J]. 软件工程, 2021, 24(11): 19-23.
	LI Z F, CHEN X R, CHEN M H, et al. Complex background segmentation based on image entropy and Fourier transform[J]. Software Engineering, 2021, 24(11): 19-23. (in Chinese)
[8]	王欣, 徐平平, 吴菲. 基于指数同态滤波耦合细节锐化规则的红外图像增强算法[J]. 电子测量与仪器学报, 2021, 35(10): 9-16.
	WANG X, XU P P, WU F. Infrared image enhancement algorithm based on exponential homomorphic filtering coupled with detail sharpening rule[J]. Journal of Electronic Measurement and Instrumentation, 2021, 35(10): 9-16. (in Chinese)
[9]	SHEN L, YUE Z H, FENG F, et al. MSR-net: low-light image enhancement using deep convolutional network[EB/OL]. (2017-11-07) [2022-06-15]. https://arxiv.org/abs/1711.02488.
[10]	QIN X B, FAN D P, HUANG C Y, et al. Boundary-aware segmentation network for mobile and web applications[EB/OL]. (2021-05-11) [2022-03-10]. https://arxiv.org/abs/2101.04704.
[11]	HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition[C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2016: 770-778.
[12]	RONNEBERGER O, FISCHER P, BROX T. U-net: convolutional networks for biomedical image segmentation[M]// Lecture Notes in Computer Science. Cham: Springer International Publishing, 2015: 234-241.
[13]	JÉGOU S, DROZDZAL M, VAZQUEZ D, et al. The one hundred layers tiramisu: fully convolutional DenseNets for semantic segmentation[C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops. New York: IEEE Press, 2017: 1175-1183.
[14]	UPADHYAY K, AGRAWAL M, VASHIST P. U-net based multi-level texture suppression for vessel segmentation in low contrast regions[C]// The 28th European Signal Processing Conference. New York: IEEE Press, 2021: 1304-1308.
[15]	董月, 冯华君, 徐之海, 等. Attention Res-Unet: 一种高效阴影检测算法[J]. 浙江大学学报: 工学版, 2019, 53(2): 373-381, 406.
	DONG Y, FENG H J, XU Z H, et al. Attention Res-Unet: an efficient shadow detection algorithm[J]. Journal of Zhejiang University: Engineering Science, 2019, 53(2): 373-381, 406. (in Chinese)
[16]	KAMRAN S A, HOSSAIN K F, TAVAKKOLI A, et al. RV-GAN: segmenting retinal vascular structure in fundus photographs using a novel multi-scale generative adversarial network[M]//Medical Image Computing and Computer Assisted Intervention - MICCAI 2021. Cham: Springer International Publishing, 2021: 34-44.
[17]	WOO S, PARK J, LEE J Y, et al. CBAM: convolutional block attention module[M]//Computer Vision - ECCV 2018. Cham: Springer International Publishing, 2018: 3-19.
[18]	蒋宏达, 叶西宁. 一种改进的I-Unet网络的皮肤病图像分割算法[J]. 现代电子技术, 2019, 42(12): 52-56.
	JIANG H D, YE X N. An improved skin disease image segmentation algorithm based on I-Unet network[J]. Modern Electronics Technique, 2019, 42(12): 52-56. (in Chinese)
[19]	侯向丹, 赵一浩, 刘洪普, 等. 融合残差注意力机制的UNet视盘分割[J]. 中国图象图形学报, 2020, 25(9): 1915-1929.
	HOU X D, ZHAO Y H, LIU H P, et al. Optic disk segmentation by combining UNet and residual attention mechanism[J]. Journal of Image and Graphics, 2020, 25(9): 1915-1929. (in Chinese)
[20]	SELVARAJU R R, COGSWELL M, DAS A, et al. Grad-CAM: visual explanations from deep networks via gradient-based localization[C]// 2017 IEEE International Conference on Computer Vision. New York: IEEE Press, 2017: 618-626.
[21]	ZHOU Z W, RAHMAN SIDDIQUEE M M, TAJBAKHSH N, et al. UNet++: A nested U-net architecture for medical image segmentation[M]//Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support. Cham: Springer International Publishing, 2018: 3-11.
[22]	钟文煜, 冯寿廷. 改进型Unet: 一种高效准确的视网膜血管分割方法[J]. 光学技术, 2019, 45(6): 744-748.
	ZHONG W Y, FENG S T. Improved Unet: an efficient and accurate retinal vessel segmentation method[J]. Optical Technique, 2019, 45(6): 744-748. (in Chinese)

主干网络	mIoU	Dice	F1 Score	FPS
ResNet14	0.788 0	0.881 5	0.894 8	13.70
ResNet18	0.801 0	0.889 5	0.903 5	12.24
ResNet34	0.792 8	0.883 8	0.897 3	8.80

主干网络	mIoU	Dice	F1 Score	FPS
ResNet14	0.788 0	0.881 5	0.894 8	13.70
ResNet18	0.801 0	0.889 5	0.903 5	12.24
ResNet34	0.792 8	0.883 8	0.897 3	8.80

注意力模块	mIoU	Dice	F1 Score
None	0.785 0	0.878 6	0.889 5
CBAM	0.790 6	0.880 9	0.892 8
Proposed	0.801 0	0.889 5	0.903 5

注意力模块	mIoU	Dice	F1 Score
None	0.785 0	0.878 6	0.889 5
CBAM	0.790 6	0.880 9	0.892 8
Proposed	0.801 0	0.889 5	0.903 5

方法	mIoU	Dice	F1 Score	收敛轮数
BCE Loss	0.795 9	0.884 7	0.897 4	-
BID Loss (1:1:1)	0.786 7	0.881 2	0.894 4	-
BID Loss (2:1:1)	0.800 4	0.890 1	0.903 2	102
BID Loss (2:1:1) +DS	0.801 0	0.889 5	0.903 5	56