Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2023, Vol. 44 ›› Issue (5): 868-878.DOI: 10.11996/JG.j.2095-302X.2023050868

• Image Processing and Computer Vision • Previous Articles     Next Articles

Lightweight human pose estimation algorithm by integrating CA and BiFPN

PI Jun(), NIU Hou-xing, GAO Zhi-yun()   

  1. School of Transportation Science and Engineering, Civil Aviation University of China, Tianjin 300300, China
  • Received:2023-05-31 Accepted:2023-08-03 Online:2023-10-31 Published:2023-10-31
  • Contact: GAO Zhi-yun (1993-), lecturer, PH.D. Her main research interests cover image processing and pattern recognition. E-mail:zhiyungao@163.com
  • About author:PI Jun (1973-), associate professor, Ph.D. His main research interests cover object detection, image processing and pattern recognition. E-mail:jpi@cauc.edu.cn
  • Supported by:
    China Association of Transport Education Research 2022-2024 Education Science Research Project(JT2022YB325)

Abstract:

To address the problems of existing heatmap-based human pose estimation network models, such as high complexity, intensive computing power requirements, and challenges in deployment on embedded platforms and UAV mobile platforms, a lightweight human pose estimation network was proposed based on YOLOv5s6-Pose-ti-lite without using heatmaps. By replacing the backbone network with GhostNet, it enabled the output of more effective feature information with reduced computing resources. This resulted in faster network detection and alleviated issues related to network redundancy. Within the backbone network, a lightweight coordinate attention (CA) attention module was integrated to gather the position information of human keypoints in the picture to the channel, thus enhancing the ability of feature extraction. BiFPN (weighted bidirectional feature pyramid network) module was introduced to enhance the feature fusion ability of the model and balance the feature information across different scales. Finally, the CIoU loss function was replaced with wise-IoU (WIoU) to enhance the performance of the model for human keypoint regression. The results demonstrated that on the COCO2017 human keypoint dataset, the parameters of the optimized network model were reduced by 26.2%, the calculation was decreased by 30.0%, the average precision was increased by 1.7 percentage points, and the average recall rate was boosted by 2.7 percentage points. These improvements could enable real-time performance, verifying the feasibility and effectiveness of the proposed model.

Key words: human pose estimation, lightweight, coordinate attention, weighted bidirectional feature pyramid network, loss function

CLC Number: