Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2022, Vol. 43 ›› Issue (2): 239-246.DOI: 10.11996/JG.j.2095-302X.2022020239

• Image Processing and Computer Vision • Previous Articles     Next Articles

Face detection and embedded implementation of lightweight network

  

  1. 1. School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China;
    2. Shenzhen Bryture Co. Ltd., Shenzhen Guangdong 518000, China;
    3. School of Mechanical Engineering, Guizhou University, Guiyang Guizhou 550025, China
  • Online:2022-04-30 Published:2022-05-07
  • Supported by:

    Fundamental Research Funds for the Central Universities (2021YJS025); 

    National Natural Science Foundation of China under Grant (62062021, 61872034, 62011530042); 

    Beijing Municipal Natural Science Foundation under Grant (4202055); 

    Guangxi Natural Science Foundation under Grant (2018GXNSFBA281086)


Abstract:  In recent years, face detection based on convolutional neural networks (CNN) has dominated this field, and
the detection results on the public benchmark set have also been significantly improved. However, the computational
cost and model complexity are on the rise. It remains a challenge to apply face detection model to embedded devices
with limited computing power and memory capacity. Aiming at the application of face detection of 320×240
resolution input images in embedded systems, a low-resolution face detection algorithm based on lightweight network
was proposed. The backbone network employed the attention module, combined Distance-IoU (DIoU) and Non-Maximum Suppression (NMS), and adopted the Mish activation function. Meanwhile, an appropriate a priori box
was set for the face feature ratio. In doing so, the balance could be achieved between precision and speed, and it could
be deployed to the embedded platform. Specifically, deep separable convolution was used to replace ordinary
convolution, and an attention convolutional block attention module (CBAM) was added after the convolution block to
keep the network’s focus on the target object to be recognized. Instead of the ReLU activation function, the Mish
activation function was used to improve the model inference speed. By combining DIoU and NMS, the algorithm’s
detection accuracy for small faces was enhanced. The results of experiments on the WIDER FACE dataset prove that
the proposed method not only can detect human faces with high accuracy in real time, but also has higher accuracy
than traditional algorithms in small resolution input. After expanding the dataset, the proposed model also improves
the detection accuracy under complex illuminations.

Key words: face detection, lightweight network, attention module, activation function, non-maximum suppression

CLC Number: