基于 LSTM 神经网络的人体动作识别

doi:10.11996/JG.j.2095-302X.2021020174

图学学报 ›› 2021, Vol. 42 ›› Issue (2): 174-181.DOI: 10.11996/JG.j.2095-302X.2021020174

• 图像处理与计算机视觉 • 上一篇下一篇

基于 LSTM 神经网络的人体动作识别

西安理工大学机械与精密仪器工程学院，陕西西安 710048

出版日期:2021-04-30 发布日期:2021-04-30
基金资助:
国家自然科学基金项目(51475365)；陕西省自然科学基础研究计划项目(2017JM5088)

Human action recognition based on LSTM neural network

School of Mechanical and Instrumental Engineering, Xi’an University of Technology, Xi’an Shaanxi 710048, China

Online:2021-04-30 Published:2021-04-30
Supported by:
National Natural Science Foundation of China (51475365); Natural Science Basic Research Program of Shaanxi Province (2017JM5088)

摘要/Abstract

摘要： 人体动作识别为人机合作提供了基础支撑，机器人通过对操作者动作进行识别和理解，可以提高制造系统的柔性和生产效率。针对人体动作识别问题，在三维骨架数据的基础上，对原始三维骨架数据进行平滑去噪处理以符合人体关节点运动的平滑规律；构建了由静态特征和动态特征组成的融合特征用来表征人体动作；引入了关键帧提取模型来提取人体动作序列中的关键帧以减少计算量；建立了以 LSTM 神经网络为基础的 Bi-LSTM 神经网络的人体动作分类模型，引入注意力机制以及 Dropout 进行人体动作分类识别，并对神经网络的主要参数采用正交试验法进行了参数优化；最后利用公开数据集进行动作识别实验。结果表明，该模型算法对人体动作具有较高的识别率。

关键词: 动作识别, 融合特征, LSTM 神经网络, 注意力机制, Dropout

Abstract: Human action recognition provides the basic support for human-computer cooperation. Robots can enhance the flexibility and production efficiency of manufacturing system by recognizing and understanding the operator’s action. To resolve the problem of human motion recognition, the original 3D skeleton data was smoothed and denoised to conform to the smooth rule of human joint-point motion based on 3D skeleton data. The fusion feature composed of static and dynamic features was constructed to represent human action. The key frame extraction model was introduced to extract the key frames in human action sequences to reduce the computing load. A Bi-LSTM neural network model based on LSTM neural network was established to classify human actions, and the attention mechanism and Dropout were utilized to classify and recognize human actions, with the main parameters of the neural network optimized by the orthogonal test method. Finally, the open data set was employed for the action recognition experiment. The results show that the proposed model algorithm has a high recognition rate for human actions.

Key words: , action recognition, fusion features, LSTM neural network, attention mechanism, Dropout

中图分类号:

TP 391.4

杨世强, 杨江涛, 李卓, 王金华, 李德信. 基于 LSTM 神经网络的人体动作识别[J]. 图学学报, 2021, 42(2): 174-181.

YANG Shi-qiang, YANG Jiang-tao, LI Zhuo, WANG Jin-hua, LI De-xin . Human action recognition based on LSTM neural network[J]. Journal of Graphics, 2021, 42(2): 174-181.

[1]	张盾, 黄志开, 王欢, 吴义鹏, 王颖, 邹家豪. 基于多尺度特征实现超参进化的野生菌分类研究与应用[J]. 图学学报, 2022, 43(4): 580-589.
[2]	贺琪, 李汶龙, 宋巍, 杜艳玲, 黄冬梅, 耿立佳 . 结合残差时空注意力机制的海面温度预测算法[J]. 图学学报, 2022, 43(4): 677-684.
[3]	方洪波, 万广, 陈忠辉, 黄以卫, 张文勇, 谢本亮. 基于改进 YOLOv5s 的离线手写数学符号识别[J]. 图学学报, 2022, 43(3): 387-395.
[4]	白静, 孟庆亮, 徐昊, 范有福, 杨瞻源. ST-Rec3D：基于结构和目标感知的三维重建[J]. 图学学报, 2022, 43(3): 469-477.
[5]	李扬科, 宋全博, 周元峰. 用于手势识别的时空融合网络以及虚拟签名系统[J]. 图学学报, 2022, 43(3): 504-512.
[6]	张明, 张芳慧, 宗佳平, 宋治, 岑翼刚, 张琳娜. 基于轻量级网络的人脸检测及嵌入式实现[J]. 图学学报, 2022, 43(2): 239-246.
[7]	苏常保, 龚世才. 基于深度学习的人物肖像全自动抠图算法[J]. 图学学报, 2022, 43(2): 247-253.
[8]	李翠云, 白静, 郑凉. 融合边缘增强注意力机制和 U-Net 网络的医学图像分割[J]. 图学学报, 2022, 43(2): 273-278.
[9]	何国忠, 梁宇. 基于卷积神经网络的 PCB 缺陷检测[J]. 图学学报, 2022, 43(1): 21-27.
[10]	史彩娟, 陈厚儒, 葛录录, 王子雯. 注意力残差多尺度特征增强的显著性实例分割[J]. 图学学报, 2021, 42(6): 883-890.
[11]	黄文明, 阳沐利, 蓝如师, 邓珍荣, 罗笑南. 融合非局部神经网络的行为检测模型 [J]. 图学学报, 2021, 42(3): 439-445.
[12]	李彬 , 王平 , 赵思逸 . 基于双重注意力机制的图像超分辨重建算法[J]. 图学学报, 2021, 42(2): 206-215.
[13]	常东良 , 尹军辉 , 谢吉洋 , 孙维亚 , 马占宇 . 面向图像分类的基于注意力引导的 Dropout[J]. 图学学报, 2021, 42(1): 32-36.
[14]	张永鹏, 张春梅, 白静. 基于 DenseNet-Attention 模型的高光谱图像分类[J]. 图学学报, 2020, 41(6): 897-904.
[15]	蒋圣南，陈恩庆，郑铭耀，段建康 . 基于 ResNeXt 的人体动作识别[J]. 图学学报, 2020, 41(2): 277-282.

基于 LSTM 神经网络的人体动作识别

Human action recognition based on LSTM neural network

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价