欢迎访问《图学学报》 分享到:

图学学报 ›› 2022, Vol. 43 ›› Issue (5): 892-900.DOI: 10.11996/JG.j.2095-302X.2022050892

• 计算机图形学与虚拟现实 • 上一篇    下一篇

一种基于改进 PointNet++网络的 三维手姿估计方法

  

  1. 北方工业大学信息学院,北京 100144
  • 出版日期:2022-10-31 发布日期:2022-10-28
  • 基金资助:
    北京市科学基金一般项目(18YTC038);北京市自然科学基金青年基金项目(4194076);北京市教委科研计划一般项目(KM201910009014) 

A 3D hand pose estimation method based on improved PointNet++ 

  1. School of Information, North China University of Technology, Beijing 100144, China
  • Online:2022-10-31 Published:2022-10-28
  • Supported by:
    General Project of Beijing Science Foundation (18YTC038); Youth Fund Project of Beijing Natural Science Foundation (4194076); General Project of Scientific Research Plan of Beijing Municipal Commission of Education (KM201910009014) 

摘要:

针对 PointNet++网络处理点云局部特征时因分组范围区过大导致计算量较大的问题,提出一种改 进的 PointNet++网络的三维手姿估计方法。首先对手势点云进行基于 Delaunay 三角剖分算法与 K 中位数聚类算 法相结合的三角剖分,得到手势点云的三角网格模型,并计算三角网格模型的边长均值;然后以三角网格模型边 长均值为半径,对最远点采样(FPS)的采样点进行球查询搜索,再根据搜索到的采样点个数极值对采样点云进行 K 近邻分组,并最终输入 PointNet 网络,完成三维手姿的位置估计。改进后的 PointNet++网络可以根据不同的点云 密度自动调整网络分组区域的局部提取点个数。实验结果表明,在不影响三维手姿估计精度的情况下,该方法提 高了 PointNet++网络的模型训练速度,并在三维手姿估计中可有效减少特征提取的计算量,使计算机能够更快地 捕捉手姿状态。

关键词: 三维手姿估计, PointNet++, Delaunay 三角剖分, 球查询搜索, K 近邻搜索

Abstract:

To address the problem that the processing of local features of point cloud in PointNet++ network sometimes results in a large amount of computation due to the large grouping range, a 3D hand pose estimation method based on the improved PointNet++ network was proposed. Firstly, the gesture point cloud was triangulated based on the combination of Delaunay triangulation algorithm and K-Median clustering algorithm, thus creating the triangular mesh model of the gesture point cloud. Simultaneously the average edge length of the triangular mesh model was calculated. Then, with the average edge length of the triangular mesh model as the radius, the points sampled by the farthest point sampling (FPS) algorithm were searched by ball query. Then the sampled point cloud was grouped by K-Nearest Neighbors algorithm according to the maximum value of the searched sampled points. Finally, the grouped point cloud was input into the PointNet to perform the 3D hand pose estimation. The improved PointNet++ network can automatically adjust the number of local abstraction points of point cloud grouping according to point cloud density at different levels. Experiments show that, without affecting the accuracy of 3D hand pose estimation, the proposed method can enhance the training speed of PointNet++, as well as effectively reducing the  computation of feature extraction in 3D hand pose estimation, so that the computer can capture the hand pose more quickly. 

Key words:  , 3D hand pose estimation, PointNet++, Delaunay triangulation, ball query search, K-nearest neighbor search 

中图分类号: