Grasp pose generation for dexterous hand with integrated knowledge transfer

doi:10.11996/JG.j.2095-302X.2025020358

Abstract

Abstract:

Grasp pose generation for a five-finger dexterous hand plays a critical role in dexterous hand grasping tasks. Firstly, an intention-based grasp pose generation network was constructed, addressing variations in human hand grasping poses under different tool usage intentions, emphasizing the functionality of grasps under different intentions. Secondly, to tackle the issue that the a grasp pose generation network trained with limited data cannot adapt to all intra-class tools, a knowledge transfer-based grasp pose generation method was proposed. This method improved knowledge transfer to adapt to various poses of intra-class target tools for functional grasp while optimizing the inter-finger self-collision issue. Finally, in constructing the mapping relationship between the human hand and five-finger dexterous hand grasp poses, key point correspondence-based mapping rules were optimized. This enabled the generation of five-finger dexterous hand grasp poses under different intentions, laying a foundation for subsequent tool use operations. By combining intention-based grasp pose generation with knowledge transfer, the intention-based grasp pose generation network trained with limited data can generate better grasp poses for intra-class target tools. Compared to the original network, the proposed method reduced the penetration volume by an average of 0.917 cm3, the simulation displacement by an average of 5.25 mm, and the inter-finger self-collision probability by an average of 49.25%.

Key words: five-finger dexterous hand, grasp pose generation, knowledge transfer, inter-finger self-collision, grasp mapping

CLC Number:

TP391
TP242

ZHANG Xuhui, GUO Yu, HUANG Shaohua, ZHENG Guanguan, TANG Pengzhou, MA Xusheng. Grasp pose generation for dexterous hand with integrated knowledge transfer[J]. Journal of Graphics, 2025, 46(2): 358-368.

Figures/Tables 27

References 21

[1]	李泳耀, 江磊, 刘宇飞, 等. 仿人灵巧手的稳定抓取方法研究综述[J]. 兵工学报, 2023, 44(11): 3237-3252. DOI
	LI Y Y, JIANG L, LIU Y F, et al. A review of stable grasping methods for humanoid dexterous hands[J]. Acta Armamentarii, 2023, 44(11): 3237-3252 (in Chinese). DOI
[2]	徐昱琳, 徐粟轩, 徐逍, 等. SHU-II五指仿人灵巧手的运动学及抓取分析[J]. 仪器仪表学报, 2018, 39(9): 30-39.
	XU Y L, XU S X, XU X, et al. Kinematics and grasping analysis of SHU-II five fingers humanoid dexterous hand[J]. Chinese Journal of Scientific Instrument, 2018, 39(9): 30-39 (in Chinese).
[3]	童立靖, 李嘉伟. 一种基于改进PointNet++网络的三维手姿估计方法[J]. 图学学报, 2022, 43(5): 892-900.
	TONG L J, LI J W. A 3D hand pose estimation method based on improved PointNet++[J]. Journal of Graphics, 2022, 43(5): 892-900 (in Chinese).
[4]	蔡世波, 陶志成, 万伟伟, 等. 机器人多指灵巧手的研究现状、趋势与挑战[J]. 机械工程学报, 2021, 57(15): 1-14. DOI
	CAI S B, TAO Z C, WAN W W, et al. Multi-fingered dexterous hands: from simplicity to complexity and simplifying complex applications[J]. Journal of Mechanical Engineering, 2021, 57(15): 1-14 (in Chinese). DOI
[5]	MILLER A T, ALLEN P K. Graspit! a versatile simulator for robotic grasping[J]. IEEE Robotics & Automation Magazine, 2004, 11(4): 110-122.
[6]	DZIDEK B M, ADAMS M J, ANDREWS J W, et al. Contact mechanics of the human finger pad under compressive loads[J]. Journal of the Royal Society Interface, 2017, 14(127): 20160935.
[7]	伍一鹤, 张振宁, 仇栋, 等. 基于深度强化学习的虚拟手自适应抓取研究[J]. 图学学报, 2021, 42(3): 462-469.
	WU Y H, ZHANG Z N, QIU D, et al. Research on adaptive grasping of virtual hands based on deep reinforcement learning[J]. Journal of Graphics, 2021, 42(3): 462-469 (in Chinese).
[8]	ZHU T Q, WU R N, LIN X B, et al. Toward human-like grasp: dexterous grasping via semantic representation of object-hand[C]// 2021 IEEE/CVF International Conference on Computer Vision. New York: IEEE Press, 2021: 15721-15731.
[9]	LIU S W, ZHOU Y, YANG J M, et al. ContactGen: generative contact modeling for grasp generation[C]// 2023 IEEE/CVF International Conference on Computer Vision. New York: IEEE Press, 2023: 20552-20563.
[10]	YANG L X, LI K L, ZHAN X Y, et al. OakInk: a large-scale knowledge repository for understanding hand-object interaction[C]// 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2022: 20921-20930.
[11]	BRAHMBHATT S, TANG C C, TWIGG C D, et al. ContactPose: a dataset of grasps with object contact and hand pose[C]// The 16th European Conference on Computer Vision. Cham: Springer, 2020: 361-378.
[12]	TAHERI O, GHORBANI N, BLACK M J, et al. GRAB: a dataset of whole-body human grasping of objects[C]// The 16th European Conference on Computer Vision. Cham: Springer, 2020: 581-600.
[13]	SOHN K, YAN X C, LEE H. Learning structured output representation using deep conditional generative models[C]// The 28th International Conference on Neural Information Processing Systems. Cambridge: MIT Press, 2015: 3483-3491.
[14]	QI C R, YI L, SU H, et al. PointNet++: deep hierarchical feature learning on point sets in a metric space[C]// The 31st International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2017: 5105-5114.
[15]	PARK J J, FLORENCE P, STRAUB J, et al. DeepSDF: learning continuous signed distance functions for shape representation[C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2019: 165-174.
[16]	LORENSEN W E, CLINE H E. Marching cubes: a high resolution 3D surface construction algorithm[M]// WOLFER. Seminal Graphics:Pioneering Efforts that Shaped the Field. New York: Association for Computing Machinery, 1998: 347-353.
[17]	ROMERO J, TZIONAS D, BLACK M J. Embodied hands: modeling and capturing hands and bodies together[J]. ACM Transactions on Graphics, 2017, 36(6): 245.
[18]	XIANG F B, QIN Y Z, MO K C, et al. SAPIEN: a simulated part-based interactive environment[C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2020: 11094-11104.
[19]	HANDA A, VAN WYK K, YANG W, et al. DexPilot: vision-based teleoperation of dexterous robotic hand-arm system[C]// 2020 IEEE International Conference on Robotics and Automation. New York: IEEE Press, 2020: 9164-9170.
[20]	QIN Y Z, WU Y H, LIU S W, et al. DexMV: imitation learning for dexterous manipulation from human videos[C]// The 17th European Conference on Computer Vision. Cham: Springer, 2022: 570-587.
[21]	JIANG H W, LIU S W, WANG J S, et al. Hand-object contact consistency reasoning for human grasps generation[C]// 2021 IEEE/CVF International Conference on Computer Vision. New York: IEEE Press, 2021: 11087-11096.

名称	规格/mm
锤子	204×117×23
锤子2	332×130×33
电钻	203×182×57
电钻2	224×174×90

名称	规格/mm
锤子	204×117×23
锤子2	332×130×33
电钻	203×182×57
电钻2	224×174×90

配置项	型号
编程语言	Python3.8
深度学习框架	Pytorch2.0
操作系统	Ubuntu22.04
CPU	Intel(R) Core(TM) i9-10980XE
运行内存	128 G
GPU	NVIDIA GeForce RTX 3090

配置项	型号
编程语言	Python3.8
深度学习框架	Pytorch2.0
操作系统	Ubuntu22.04
CPU	Intel(R) Core(TM) i9-10980XE
运行内存	128 G
GPU	NVIDIA GeForce RTX 3090

指标名称	指标评估内容	评估方法
手-物互穿体积	评估物理合理性	通过将网格体素化为1 mm³立方体并计算手表面内部体素体积的总和来作为互穿体积
仿真位移	评估抓取的稳定性	将物体和预测的抓取放入模拟器中，并测量物体质心在重力的影响下的平均模拟位移
手部指间自碰撞	评估手部不同区域碰撞情况	将手部的三角面片模型划分为6个区域，并将存在连接关系的区域之间的面片进行排除，避免计算碰撞关系时存在歧义，如图12所示，通过计算不同区域之间是否存在穿透来判断是否发生碰撞。计算碰撞的抓取姿态在所生成的n个抓取姿态中所占的百分比
平均最大穿透深度	评估灵巧手的抓取质量	选取n个抓取姿态，计算映射后的灵巧手与工具的凸包碰撞体之间的平均最大穿透深度
收敛比例	评估映射规则的收敛性	选取n个抓取姿态，统计在m次迭代之内，映射函数小于阈值的比例
抓取姿态的合理性	定性评估抓取姿态	以训练源数据中不同意图下的抓取姿态为参考，判断生成的抓取姿态是否符合指定的意图，抓取位置是否合适并满足视觉合理性