人体运动视频关键帧优化及行为识别

doi:10.11996/JG.j.2095-302X.2018030463

图学学报

人体运动视频关键帧优化及行为识别

广西民族大学信息科学与工程学院，广西南宁 530006

出版日期:2018-06-30 发布日期:2018-07-10
基金资助:
广西自然科学基金项目(2015GXNSFAA139311)

Optimization and Behavior Identification of Keyframes in Human Action Video

School of Information Science and Engineering, Guangxi University for Nationalities, Nanning Guangxi 530006, China

Online:2018-06-30 Published:2018-07-10

摘要/Abstract

摘要： 在行为识别过程中，提取视频关键帧可以有效减少视频索引的数据量，从而提高
动作识别的准确性和实时性。为提高关键帧的代表性，提出一种关键帧序列优化方法，并在此
基础上进行行为识别。首先根据3D 人体骨架特征利用K-均值聚类算法提取人体运动视频序列
中的关键帧，然后根据关键帧所在序列中的位置进行二次优化以提取最优关键帧，解决了传统
方法中关键帧序列冗余等问题。最后根据最优关键帧利用卷积神经网络(CNN)分类器对行为视
频进行识别。在Florence3D-Action 数据库上的实验结果表明，该方法具有较高的识别率，并且
与传统方法相比大幅度缩短了识别时间。

关键词: 行为识别, 关键帧, K-均值, 卷积神经网络

Abstract: In the course of behavior identification, extracting keyframes from the video can
effectively reduce the amount of video index data, so as to improve the accuracy and real-time
performance of behavior identification. A method for optimizing the keyframe sequence is proposed
to improve the representativeness of keyframes, on which the behavior identification is based. Firstly,
the K-means clustering algorithm is employed to extract keyframes in the human action video
sequence according to 3D human skeleton features. Then, the quadratic optimization is performed in
the light of the location of keyframes to extract the optimal keyframe, and it can reduce the redundancy
of keyframe sequence, compared with traditional ways. Finally, the behavior video is identified by
convolutional neural network (CNN) classifiers in accordance with the optimal keyframe. The
experiment results on the Florence 3D Action dataset indicate that the method has a high identification
rate, and drastically shortens the identification time, compared with the traditional method.

Key words: behavior identification, keyframes, K-means, convolutional neural network

赵洪，宣士斌. 人体运动视频关键帧优化及行为识别[J]. 图学学报, DOI: 10.11996/JG.j.2095-302X.2018030463.

ZHAO Hong, XUAN Shibin. Optimization and Behavior Identification of Keyframes in Human Action Video[J]. Journal of Graphics, DOI: 10.11996/JG.j.2095-302X.2018030463.

[1]	张盾, 黄志开, 王欢, 吴义鹏, 王颖, 邹家豪. 基于多尺度特征实现超参进化的野生菌分类研究与应用[J]. 图学学报, 2022, 43(4): 580-589.
[2]	廖志伟, 金兢, 张超凡, 杨学志. 基于分层压缩激励的 ASPP 网络单目深度估计[J]. 图学学报, 2022, 43(2): 214-222.
[3]	苏常保, 龚世才. 基于深度学习的人物肖像全自动抠图算法[J]. 图学学报, 2022, 43(2): 247-253.
[4]	何国忠, 梁宇. 基于卷积神经网络的 PCB 缺陷检测[J]. 图学学报, 2022, 43(1): 21-27.
[5]	汪玉金, 谢诚, 余蓓蓓, 向鸿鑫, 柳青. 属性语义与图谱语义融合增强的零次学习图像识别[J]. 图学学报, 2021, 42(6): 899-907.
[6]	张成 , 侯宇超 , 焦宇倩 , 白艳萍 , 李建军 . 基于三通道分离特征融合与支持向量机的混凝土图像分类研究[J]. 图学学报, 2021, 42(6): 917-923.
[7]	马欢, 冀晶晶, 刘佳豪, 刘雨婷. 面向机器人自主分割的肉品识别分类系统实现[J]. 图学学报, 2021, 42(6): 924-930.
[8]	封筠 , 赵颖 , 毕健康 , 赖柏江 , 胡晶晶 . 多级卷积神经网络的沥青路面裂缝图像层次化筛选[J]. 图学学报, 2021, 42(5): 719-728.
[9]	张明华 , 牛玉莹 , 杜艳玲 , 黄冬梅 , 刘刻福 . 基于残差 3DCNN 和三维 Gabor 滤波器的高光谱图像分类[J]. 图学学报, 2021, 42(5): 729-737.
[10]	满开亮, 汪友生, 刘继荣. 基于稠密残差网络的图像超分辨率重建算法[J]. 图学学报, 2021, 42(4): 556-562.
[11]	张鹏飞 , 石志良 , 李晓垚 , 欧阳祥波 . 基于深度学习的主轴承盖分类识别算法[J]. 图学学报, 2021, 42(4): 572-580.
[12]	官申珂, 林晓, 郑晓妹, 朱媛媛, 马利庄 . 结合超像素分割的多尺度特征融合图像语义分割算法 [J]. 图学学报, 2021, 42(3): 406-413.
[13]	林晓 , 屈时操 , 黄伟 , 郑晓妹 , 马利庄 . 显著区域保留的图像风格迁移算法[J]. 图学学报, 2021, 42(2): 190-197.
[14]	黄欢 , 孙力娟 , 曹莹 , 郭剑 , 任恒毅 . 基于注意力的短视频多模态情感分析[J]. 图学学报, 2021, 42(1): 8-14.
[15]	刘昶, 徐超远, 张鑫, 薛磊. 液晶字符识别的 CNN 和 SVM 组合分类器[J]. 图学学报, 2021, 42(1): 15-22.

人体运动视频关键帧优化及行为识别

Optimization and Behavior Identification of Keyframes in Human Action Video

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价