基于轻量级网络的人脸检测及嵌入式实现

doi:10.11996/JG.j.2095-302X.2022020239

图学学报 ›› 2022, Vol. 43 ›› Issue (2): 239-246.DOI: 10.11996/JG.j.2095-302X.2022020239

• 图像处理与计算机视觉 • 上一篇下一篇

基于轻量级网络的人脸检测及嵌入式实现

1. 北京交通大学信息科学研究所，北京 100044；
2. 深圳市光点智能科技有限公司，广东深圳 518000；
3. 贵州大学机械工程学院，贵州贵阳 550025

出版日期:2022-04-30 发布日期:2022-05-07
基金资助:
中央高校基本科研业务费(2021YJS025)；

国家自然科学基金项目(62062021，61872034，62011530042)；

北京市自然科学基金项目(4202055)；

广西自然科学基金资助(2018GXNSFBA281086)

Face detection and embedded implementation of lightweight network

1. School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China;
2. Shenzhen Bryture Co. Ltd., Shenzhen Guangdong 518000, China;
3. School of Mechanical Engineering, Guizhou University, Guiyang Guizhou 550025, China

Online:2022-04-30 Published:2022-05-07
Supported by:
Fundamental Research Funds for the Central Universities (2021YJS025);

National Natural Science Foundation of China under Grant (62062021, 61872034, 62011530042);

Beijing Municipal Natural Science Foundation under Grant (4202055);

Guangxi Natural Science Foundation under Grant (2018GXNSFBA281086)

摘要/Abstract

摘要： 尽管基于卷积神经网络(CNN)的人脸检测器在精度上已经有了很大提升，但所需的计算量和模
型复杂度越来越高，如何在计算能力有限的嵌入式设备上应用人脸检测模型是一个很大的挑战。针对 320×240
分辨率输入图像的人脸检测在嵌入式系统上的应用问题，提出了一种基于轻量级网络的低分辨率人脸检测算
法。该算法使用注意力机制、结合了 Distance-IoU (DIoU)与非极大值抑制(NMS)、使用 Mish 激活函数，同时针
对人脸特征比例设置合适的先验框，实现了精度和速度的平衡，并部署到嵌入式平台中。具体地，用深度可分
离卷积替代普通卷积，并在卷积块后加入注意力模块(CBAM)，使网络更关注待识别的目标物体；代替 ReLU
激活函数，采用了 Mish 激活函数来提高模型推理速度；通过结合 DIoU 与 NMS，提高模型对小人脸的检测能
力。实验在 WIDER FACE 数据集的结果证明，该方法不仅能实时高精度地进行人脸检测，而且在小分辨率输
入上，精度高于传统算法。扩充数据集之后，模型在复杂光照下的泛化性得到提高。

关键词: 人脸检测, 轻量级网络, 注意力机制, 激活函数, 非极大值抑制

Abstract: In recent years, face detection based on convolutional neural networks (CNN) has dominated this field, and
the detection results on the public benchmark set have also been significantly improved. However, the computational
cost and model complexity are on the rise. It remains a challenge to apply face detection model to embedded devices
with limited computing power and memory capacity. Aiming at the application of face detection of 320×240
resolution input images in embedded systems, a low-resolution face detection algorithm based on lightweight network
was proposed. The backbone network employed the attention module, combined Distance-IoU (DIoU) and Non-Maximum Suppression (NMS), and adopted the Mish activation function. Meanwhile, an appropriate a priori box
was set for the face feature ratio. In doing so, the balance could be achieved between precision and speed, and it could
be deployed to the embedded platform. Specifically, deep separable convolution was used to replace ordinary
convolution, and an attention convolutional block attention module (CBAM) was added after the convolution block to
keep the network’s focus on the target object to be recognized. Instead of the ReLU activation function, the Mish
activation function was used to improve the model inference speed. By combining DIoU and NMS, the algorithm’s
detection accuracy for small faces was enhanced. The results of experiments on the WIDER FACE dataset prove that
the proposed method not only can detect human faces with high accuracy in real time, but also has higher accuracy
than traditional algorithms in small resolution input. After expanding the dataset, the proposed model also improves
the detection accuracy under complex illuminations.

Key words: face detection, lightweight network, attention module, activation function, non-maximum suppression

中图分类号:

TP 751.1

张明, 张芳慧, 宗佳平, 宋治, 岑翼刚, 张琳娜. 基于轻量级网络的人脸检测及嵌入式实现[J]. 图学学报, 2022, 43(2): 239-246.

ZHANG Ming, ZHANG Fang-hui, ZONG Jia-ping, SONG Zhi, CEN Yi-gang, ZHANG Lin-na . Face detection and embedded implementation of lightweight network[J]. Journal of Graphics, 2022, 43(2): 239-246.

[1]	张盾, 黄志开, 王欢, 吴义鹏, 王颖, 邹家豪. 基于多尺度特征实现超参进化的野生菌分类研究与应用[J]. 图学学报, 2022, 43(4): 580-589.
[2]	贺琪, 李汶龙, 宋巍, 杜艳玲, 黄冬梅, 耿立佳 . 结合残差时空注意力机制的海面温度预测算法[J]. 图学学报, 2022, 43(4): 677-684.
[3]	方洪波, 万广, 陈忠辉, 黄以卫, 张文勇, 谢本亮. 基于改进 YOLOv5s 的离线手写数学符号识别[J]. 图学学报, 2022, 43(3): 387-395.
[4]	白静, 孟庆亮, 徐昊, 范有福, 杨瞻源. ST-Rec3D：基于结构和目标感知的三维重建[J]. 图学学报, 2022, 43(3): 469-477.
[5]	李扬科, 宋全博, 周元峰. 用于手势识别的时空融合网络以及虚拟签名系统[J]. 图学学报, 2022, 43(3): 504-512.
[6]	苏常保, 龚世才. 基于深度学习的人物肖像全自动抠图算法[J]. 图学学报, 2022, 43(2): 247-253.
[7]	李翠云, 白静, 郑凉. 融合边缘增强注意力机制和 U-Net 网络的医学图像分割[J]. 图学学报, 2022, 43(2): 273-278.
[8]	何国忠, 梁宇. 基于卷积神经网络的 PCB 缺陷检测[J]. 图学学报, 2022, 43(1): 21-27.
[9]	史彩娟, 陈厚儒, 葛录录, 王子雯. 注意力残差多尺度特征增强的显著性实例分割[J]. 图学学报, 2021, 42(6): 883-890.
[10]	黄文明, 阳沐利, 蓝如师, 邓珍荣, 罗笑南. 融合非局部神经网络的行为检测模型 [J]. 图学学报, 2021, 42(3): 439-445.
[11]	杨世强, 杨江涛, 李卓, 王金华, 李德信. 基于 LSTM 神经网络的人体动作识别[J]. 图学学报, 2021, 42(2): 174-181.
[12]	李彬 , 王平 , 赵思逸 . 基于双重注意力机制的图像超分辨重建算法[J]. 图学学报, 2021, 42(2): 206-215.
[13]	常东良 , 尹军辉 , 谢吉洋 , 孙维亚 , 马占宇 . 面向图像分类的基于注意力引导的 Dropout[J]. 图学学报, 2021, 42(1): 32-36.
[14]	张永鹏, 张春梅, 白静. 基于 DenseNet-Attention 模型的高光谱图像分类[J]. 图学学报, 2020, 41(6): 897-904.
[15]	袁建平 1，陈晓龙 1，陈显龙 1，何恩杰 1，张加其 2，高宇豆 2 . 基于文本与视觉信息的细粒度图像分类[J]. 图学学报, 2019, 40(3): 503-512.

基于轻量级网络的人脸检测及嵌入式实现

Face detection and embedded implementation of lightweight network

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价