面向轻量卷积神经网络的训练后量化方法

doi:10.11996/JG.j.2095-302X.2025040709

图学学报 ›› 2025, Vol. 46 ›› Issue (4): 709-718.DOI: 10.11996/JG.j.2095-302X.2025040709

• 图像处理与计算机视觉 • 上一篇下一篇

面向轻量卷积神经网络的训练后量化方法

杨杰¹(), 李琮¹, 胡庆浩²(), 陈显达¹, 王云鹏¹, 刘晓晶¹

1.国网山东省电力公司济南供电公司，山东济南 250012
2.中国科学院自动化研究所复杂系统认知与决策重点实验室，北京 100190

收稿日期:2024-10-05 修回日期:2024-12-13 出版日期:2025-08-30 发布日期:2025-08-11
通讯作者:胡庆浩(1992-)，男，副研究员，博士。主要研究方向为深度神经网络轻量化。E-mail：huqinghao2014@ia.ac.cn
第一作者:杨杰(1989-)，男，高级工程师，硕士。主要研究方向为设备运检与边缘智能。E-mail：18753137902@139.com
基金资助:
国网山东省电力公司科技项目(52060122000Q)

A post-training quantization method for lightweight CNNs

YANG Jie¹(), LI Cong¹, HU Qinghao²(), CHEN Xianda¹, WANG Yunpeng¹, LIU Xiaojing¹

1. State Grid Jinan Power Supply Company, Jinan Shandong 250012, China
2. The Key Laboratory of Cognition and Decision Intelligence for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China

Received:2024-10-05 Revised:2024-12-13 Published:2025-08-30 Online:2025-08-11
First author：YANG Jie (1989-), senior engineer, master. His main research interests cover equipment operation and inspection. E-mail：18753137902@139.com
Supported by:
The Science and Technology Project of State Grid Shandong Electric Power Company(52060122000Q)

摘要/Abstract

摘要：

当前训练后量化方法(post-training quantization)在高比特量化位宽下可以实现精度近乎无损的量化，但对于轻量卷积神经网络(CNN)来说，其量化误差仍然不可忽视，特别是低位宽(<4比特)量化的情况。针对该问题，提出了一种面向轻量CNN的训练后量化方法，即块级批归一化学习(BBL)方法。不同于当前训练后量化方法合并批归一化层的方式，该方法以模型块为单位保留批归一化层的权重，基于块级特征图重建损失对模型量化参数和批归一化层的参数进行学习，且更新批归一化层的均值和方差等统计量，以一种简单且有效的方式缓解了轻量CNN在低比特量化时产生的分布漂移问题。其次，为了降低训练后量化方法对校准数据集的过拟合，构建了块级的数据增强方法，避免不同模型块对同一批校准数据进行学习。并在ImageNet数据集上进行了实验验证，实验结果表明，相比于当前训练后量化算法，BBL方法识别精度最高能提升7.72个百分点，并有效减少轻量CNN在低比特训练后量化时产生的量化误差。

关键词: 深度神经网络压缩, 训练后量化, 低比特量化, 轻量卷积神经网络, 轻量化智能

Abstract:

The current post-training quantization methods can achieve near lossless quantization at high quantization bit-width, however, for lightweight convolutional neural networks (CNN), the quantization error remains nonnegligible, especially in the case of low bit-width quantization (<4 bits). To address this, a post-training quantization method for lightweight CNN, called the block-level BatchNorm learning (BBL) method, was proposed. Unlike current post-training quantization methods that merge the batch normalization layers, this method retained the weights of the batch normalization layer on a per-block basis, and learned the quantized model parameters and batch normalization layer parameters based on the block-level feature map reconstruction loss. It also updated the mean and variance statistics of the batch normalization layer. This method mitigated the distribution shift problem caused by low-bit quantization of lightweight CNN in a simple and effective manner. Furthermore, to reduce overfitting of the post-training quantization method to the calibration dataset, the method constructed a block-level data augmentation approach by ensuring different model blocks did not learn from the same batch of calibration data. To verify the proposed method, extensive experiments on the ImageNet dataset, demonstrated that compared with current post-training quantization algorithms, the BBL method can improve the accuracy by up to 7.72 percentage points and can effectively reduce the quantization error caused by low-bit post-training quantization of lightweight CNN.

Key words: deep neural networks compression, post-training quantization, low-bit quantization, lightweight convolutional neural networks, lightweight intelligence

中图分类号:

TP183

杨杰, 李琮, 胡庆浩, 陈显达, 王云鹏, 刘晓晶. 面向轻量卷积神经网络的训练后量化方法[J]. 图学学报, 2025, 46(4): 709-718.

YANG Jie, LI Cong, HU Qinghao, CHEN Xianda, WANG Yunpeng, LIU Xiaojing. A post-training quantization method for lightweight CNNs[J]. Journal of Graphics, 2025, 46(4): 709-718.

图/表 9

参考文献 27

[1]	KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks[J]. Communications of the ACM, 2017, 60(6): 84-90.
[2]	REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, real-time object detection[C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2016: 779-788.
[3]	LONG J, SHELHAMER E, DARRELL T. Fully convolutional networks for semantic segmentation[C]// 2015 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2015: 3431-3440.
[4]	张明, 张芳慧, 宗佳平, 等. 基于轻量级网络的人脸检测及嵌入式实现[J]. 图学学报, 2022, 43(2): 239-246.
	ZHANG M, ZHANG F H, ZONG J P, et al. Face detection and embedded implementation of lightweight network[J]. Journal of Graphics, 2022, 43(2): 239-246 (in Chinese). DOI
[5]	皮骏, 刘宇恒, 李久昊. 基于YOLOv5s的轻量化森林火灾检测算法研究[J]. 图学学报, 2023, 44(1): 26-32. DOI
	PI J, LIU Y H, LI J H. Research on lightweight forest fire detection algorithm based on YOLOv5s[J]. Journal of Graphics, 2023, 44(1): 26-32 (in Chinese). DOI
[6]	COURBARIAUX M, BENGIO Y, DAVID J P. BinaryConnect: training deep neural networks with binary weights during propagations[C]// The 29th International Conference on Neural Information Processing Systems. Cambridge: MIT Press, 2015: 3123-3131.
[7]	RASTEGARI M, ORDONEZ V, REDMON J, et al. XNOR-Net: ImageNet classification using binary convolutional neural networks[C]// The 14th European Conference on Computer Vision. Cham: Springer, 2016: 525-542.
[8]	CHOI J, WANG Z, VENKATARAMANI S, et al. PACT: parameterized clipping activation for quantized neural networks[EB/OL]. [2024-07-05]. https://arxiv.org/abs/1805.06085.
[9]	ZHANG D Q, YANG J L, YE D Q Z, et al. LQ-Nets: learned quantization for highly accurate and compact deep neural networks[C]// The 15th European Conference on Computer Vision. Cham: Springer, 2018: 373-390.
[10]	ESSER S K, MCKINSTRY J L, BABLANI D, et al. Learned step size quantization[EB/OL]. [2024-06-05]. https://arxiv.org/abs/1902.08153.
[11]	HU Q H, WANG P S, CHENG J. From hashing to CNNs: training binary weight networks via Hashing[EB/OL]. [2024-07-05]. https://ojs.aaai.org/index.php/AAAI/article/view/11660.
[12]	JACOB B, KLIGYS S, CHEN B, et al. Quantization and training of neural networks for efficient integer-arithmetic- only inference[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2018: 2704-2713.
[13]	MIGACZ S. 8-bit Inference with TensorRT[EB/OL]. [2024-06-05]. https://www.cse.iitd.ernet.in/-rijurekha/course/tensorrt.pdf.
[14]	BANNER R, NAHSHAN Y, SOUDRY D. Post training 4-bit quantization of convolutional networks for rapid-deployment[C]// The 33rd International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2019: 7950-7958.
[15]	WU D, TANG Q, ZHAO Y L, et al. EasyQuant: post-training quantization via scale optimization[EB/OL]. [2024-06-05]. https://arxiv.org/abs/2006.16669.
[16]	WANG P S, CHEN Q, HE X Y, et al. Towards accurate post-training network quantization via bit-split and stitching[EB/OL]. [2024-06-05]. https://dl.acm.org/doi/10.5555/3524938.3525851.
[17]	NAGEL M, AMJAD R, VAN BAALEN M, et al. Up or down? adaptive rounding for post-training quantization[EB/OL]. [2024-06-05]. https://dl.acm.org/doi/10.5555/3524938.3525605.
[18]	LI Y H, GONG R H, TAN X, et al. BRECQ: pushing the limit of post-training quantization by block reconstruction[EB/OL]. [2024-07-05]. https://arxiv.org/abs/2102.05426.
[19]	WEI X Y, GONG R H, LI Y H, et al. QDrop: randomly dropping quantization for extremely low-bit post-training quantization[EB/OL]. [2024-07-05]. https://arxiv.org/abs/2203.05740.
[20]	MA Y X, LI H X, ZHENG X W, et al. Solving oscillation problem in post-training quantization through a theoretical perspective[C]// 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2023: 7950-7959.
[21]	LEE J H, KIM J, KWON S J, et al. FlexRound: learnable rounding based on element-wise division for post-training quantization[EB/OL]. [2024-06-05]. https://dl.acm.org/doi/10.5555/3618408.3619189.
[22]	HOWARD A G, ZHU M L, CHEN B, et al. MobileNets: efficient convolutional neural networks for mobile vision applications[EB/OL]. [2024-07-05].
[23]	SANDLER M, HOWARD A, ZHU M L, et al. MobileNetV2: inverted residuals and linear bottlenecks[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2018: 4510-4520.
[24]	ZHANG X Y, ZHOU X Y, LIN M X, et al. ShuffleNet: an extremely efficient convolutional neural network for mobile devices[C]// The IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2018: 6848-6856.
[25]	MA N N, ZHANG X Y, ZHENG H T, et al. ShuffleNet V2: practical guidelines for efficient CNN architecture design[C]// The 15th European Conference on Computer Vision. Cham: Springer, 2018: 122-138.
[26]	RADOSAVOVIC I, KOSARAJU R P, GIRSHICK R, et al. Designing network design spaces[C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2020: 10425-10433.
[27]	DENG J, DONG W, SOCHER R, et al. ImageNet: a large-scale hierarchical image database[C]// 2009 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2009: 248-255.

方法	位宽(W/A)	MBV2	SFV2	Reg600
全精度模型	32/32	72.49	69.36	73.71
AdaRound	4/4	61.52	-	68.20
BRECQ	4/4	67.51	-	70.44
QDrop	4/4	67.89	63.45	70.62
Ours+QDrop	4/4	68.13	64.80	70.74
MRECG	4/4	68.84	-	71.22
Ours+MRECG	4/4	68.97	-	71.09
AdaRound	2/4	36.31	-	57.00
BRECQ	2/4	52.30	-	61.77
QDrop	2/4	52.92	46.36	63.10
Ours+QDrop	2/4	55.16	52.11	63.72
MRECG	2/4	57.85	-	65.16
Ours+MRECG	2/4	58.84	-	65.62
AdaRound	3/3	34.55	-	58.29
BRECQ	3/3	52.03	-	62.61
QDrop	3/3	54.27	49.22	64.53
Ours+QDrop	3/3	56.33	52.88	64.58
MRECG	3/3	58.40	-	66.08
Ours+MRECG	3/3	60.51	-	66.22
BRECQ	2/2	7.03	-	28.89
QDrop	2/2	8.46	6.33	38.90
Ours+QDrop	2/2	16.18	10.50	41.29
MRECG	2/2	14.44	-	43.67
Ours+MRECG	2/2	22.03	-	45.27

方法	位宽(W/A)	MBV2	SFV2	Reg600
全精度模型	32/32	72.49	69.36	73.71
AdaRound	4/4	61.52	-	68.20
BRECQ	4/4	67.51	-	70.44
QDrop	4/4	67.89	63.45	70.62
Ours+QDrop	4/4	68.13	64.80	70.74
MRECG	4/4	68.84	-	71.22
Ours+MRECG	4/4	68.97	-	71.09
AdaRound	2/4	36.31	-	57.00
BRECQ	2/4	52.30	-	61.77
QDrop	2/4	52.92	46.36	63.10
Ours+QDrop	2/4	55.16	52.11	63.72
MRECG	2/4	57.85	-	65.16
Ours+MRECG	2/4	58.84	-	65.62
AdaRound	3/3	34.55	-	58.29
BRECQ	3/3	52.03	-	62.61
QDrop	3/3	54.27	49.22	64.53
Ours+QDrop	3/3	56.33	52.88	64.58
MRECG	3/3	58.40	-	66.08
Ours+MRECG	3/3	60.51	-	66.22
BRECQ	2/2	7.03	-	28.89
QDrop	2/2	8.46	6.33	38.90
Ours+QDrop	2/2	16.18	10.50	41.29
MRECG	2/2	14.44	-	43.67
Ours+MRECG	2/2	22.03	-	45.27

方法	位宽 (W/A)	MBV2/%		ResNet18/%
方法	位宽 (W/A)	Top1 Acc	Top5 Acc	Top1 Acc	Top5 Acc
FlexRound	4/4	66.66	87.21	69.26	88.81
Ours*	4/4	68.72	88.50	69.35	88.96
FlexRound	3/3	51.49	76.90	65.43	86.60
Ours*	3/3	56.60	80.48	66.05	87.08

方法	位宽 (W/A)	MBV2/%		ResNet18/%
方法	位宽 (W/A)	Top1 Acc	Top5 Acc	Top1 Acc	Top5 Acc
FlexRound	4/4	66.66	87.21	69.26	88.81
Ours*	4/4	68.72	88.50	69.35	88.96
FlexRound	3/3	51.49	76.90	65.43	86.60
Ours*	3/3	56.60	80.48	66.05	87.08

方法	位宽 (W/A)	块级BN 参数学习	块级数据增强	MBV2/%
Baseline	4/4			67.89
Baseline +BN参数学习	4/4	√		68.08
Ours	4/4	√	√	68.13
Baseline	2/4			52.92
Baseline +BN参数学习	2/4	√		54.86
Ours	2/4	√	√	55.16
Baseline	3/3			54.27
Baseline +BN参数学习	3/3	√		56.28
Ours	3/3	√	√	56.33
Baseline	2/2			8.46
Baseline +BN参数学习	2/2	√		15.79
Ours	2/2	√	√	16.18

面向轻量卷积神经网络的训练后量化方法

A post-training quantization method for lightweight CNNs

RichHTML

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

图/表 9

参考文献 27

相关文章 9

编辑推荐

Metrics

本文评价

[1]	郭瑞东, 蓝贵文, 范冬林, 钟展, 徐梓睿, 任新月. 基于特征聚焦扩散网络的电力巡检目标检测算法[J]. 图学学报, 2025, 46(4): 719-726.
[2]	王素琴, 杜雨洁, 石敏, 朱登明. 类别不均衡的少样本工业产品表观缺陷检测[J]. 图学学报, 2025, 46(3): 568-577.
[3]	陈冠豪, 徐丹, 贺康建, 施洪贞, 张浩. 基于转置注意力和CNN的图像超分辨率重建网络[J]. 图学学报, 2025, 46(1): 35-46.
[4]	李珍峰, 符世琛, 徐乐, 孟博, 张昕, 秦建军. 基于MBI-YOLOv8的煤矸石目标检测算法研究[J]. 图学学报, 2024, 45(6): 1301-1312.
[5]	孙己龙, 刘勇, 周黎伟, 路鑫, 侯小龙, 王亚琼, 王志丰. 基于DCNv2和Transformer Decoder的隧道衬砌裂缝高效检测模型研究[J]. 图学学报, 2024, 45(5): 1050-1061.
[6]	胡欣, 常娅姝, 秦皓, 肖剑, 程鸿亮. 基于改进YOLOv8和GMM图像点集匹配的双目测距方法[J]. 图学学报, 2024, 45(4): 714-725.
[7]	李松洋, 王雪婷, 陈相龙, 陈恩庆. 基于骨骼点动态时域滤波的人体动作识别[J]. 图学学报, 2024, 45(4): 760-769.
[8]	朱光辉, 缪君, 胡宏利, 申基, 杜荣华. 基于自增强注意力机制的室内单图像分段平面三维重建[J]. 图学学报, 2024, 45(3): 464-471.
[9]	王佳婧, 王晨, 朱媛媛, 王笑梅. 基于民国纸币的图元素匹配检索[J]. 图学学报, 2023, 44(3): 492-501.