基于人体姿态估计与聚类的特定运动帧获取方法

doi:10.11996/JG.j.2095-302X.2022010044

图学学报 ›› 2022, Vol. 43 ›› Issue (1): 44-52.DOI: 10.11996/JG.j.2095-302X.2022010044

• 图像处理与计算机视觉 • 上一篇下一篇

基于人体姿态估计与聚类的特定运动帧获取方法

上海师范大学信息与机电工程学院，上海 200234

出版日期:2022-02-28 发布日期:2022-02-16
基金资助:
国家自然科学基金项目(61775139)；上海市地方能力建设项目(19070502900)

Acquisition method of specific motion frame based on human attitude estimation and clustering

School of Information and Electromechanical Engineering, Shanghai Normal University, Shanghai 200234, China

Online:2022-02-28 Published:2022-02-16
Supported by:
National Natural Science Foundation of China (61775139); Shanghai Local Capacity Building Project (19070502900)

摘要/Abstract

摘要： 运动视频中特定运动帧的获取是运动智能化教学实现的重要环节，为了得到视频中的特定运动帧以便进一步地对视频进行分析，并利用姿态估计和聚类的相关知识，提出了一种对运动视频提取特定运动帧的方法。首先选用 HRNet 姿态估计模型作为基础，该模型精度高但模型规模过大，为了实际运用的需求，对该模型进行轻量化处理并与 DARK 数据编码相结合，提出了 Small-HRNet 网络模型，在基本保持精度不变的情况下参数量减少了 82.0%。然后利用 Small-HRNet 模型从视频中提取人体关节点，将每一视频帧中的人体骨架特征作为聚类的样本点，最终以标准运动帧的骨架特征为聚类中心，对整个视频进行聚类得到视频的特定运动帧，在武术运动数据集上进行实验。该方法对武术动作帧的提取准确率为 87.5%，能够有效地提取武术动作帧。

关键词: 特定运动帧, 姿态估计, 数据编解码, 运动特征, 聚类

Abstract: The acquisition of specific motion frames in motion video was an important part of intelligent teaching. In order to obtain specific motion frames in video for further analysis, a method of extracting specific motion frames from motion video was proposed using the knowledge of pose estimation and clustering. Firstly, the HRNet attitude estimation model was adopted as the basis, which was of high precision but large scale. To meet the needs of practical application, this paper proposed a Small-HRNet network model by combining it with the data encoding of DARK. The parameters were reduced by 82.0% while the precision was kept unchanged. Then, the Small-HRNet model was employed to extract human joint points from the video. The human skeleton feature in each video frame served as the sample point of clustering, and finally the whole video was clustered by the skeleton feature of the standard motion frame as the clustering center to produce the specific motion frame of the video. The experiment was carried out on the martial arts data set, and the accuracy rate of the martial arts action frame extraction was 87.5%, which can effectively extract the martial arts action frame.

Key words: specific motion frame, attitude estimation, data encoding and decoding, movement characteristics, clustering

中图分类号:

TP 391

蔡敏敏, 黄继风, 林晓, 周小平. 基于人体姿态估计与聚类的特定运动帧获取方法[J]. 图学学报, 2022, 43(1): 44-52.

CAI Min-min, HUANG Ji-feng, LIN Xiao, ZHOU Xiao-ping . Acquisition method of specific motion frame based on human attitude estimation and clustering [J]. Journal of Graphics, 2022, 43(1): 44-52.

[1]	蔡兴泉, 霍宇晴, 李发建, 孙海燕. 面向太极拳学习的人体姿态估计及相似度计算[J]. 图学学报, 2022, 43(4): 695-706.
[2]	李忠伟, 徐斌, 李永, 宫凯旋, 刘格格. 基于非结构化三角网格的海洋流场可视化[J]. 图学学报, 2022, 43(3): 486-495.
[3]	李妮妮, 王夏黎, 付阳阳, 郑凤仙, 何丹丹, 袁绍欣. 一种优化 YOLO 模型的交通警察目标检测方法[J]. 图学学报, 2022, 43(2): 296-305.
[4]	刘玉杰, 张敏杰, 李宗民, 李华. 基于全局姿态感知的轻量级人体姿态估计[J]. 图学学报, 2022, 43(2): 333-341.
[5]	张豪远, 徐丹, 罗海妮, 杨冰. 基于边缘重建的多尺度壁画修复方法[J]. 图学学报, 2021, 42(4): 590-598.
[6]	任好盼, 王文明, 危德健, 高彦彦, 康智慧, 王全玉. 基于高分辨率网络的人体姿态估计方法[J]. 图学学报, 2021, 42(3): 432-438.
[7]	王春香, 刘流, 周国勇, 纪康辉. 面向自动修补的圆柱特征孔洞识别[J]. 图学学报, 2021, 42(3): 511-516.
[8]	罗国亮, 王贺, 赵昕, 曹义亲, 黄晓生, 邬昌兴, 冼楚华 . 基于数据结构化的三维动画压缩方法研究[J]. 图学学报, 2021, 42(2): 182-189.
[9]	冯洁 , 李博 , 周秉锋 , . 基于像素聚类的空间变化表面材质建模[J]. 图学学报, 2021, 42(1): 94-100.
[10]	唐科威, 穆梦娇, 李缙红, 张杰, 姜伟, 彭兴璇. 基于快速凸无穷范数极小化的大量子空间的子空间分割[J]. 图学学报, 2020, 41(6): 954-961.
[11]	李桂，李腾. 基于姿态引导的场景保留人物视频生成[J]. 图学学报, 2020, 41(4): 539-547.
[12]	李文生，原达，苗翠，王冬雨. 基于多标签层次聚类的GPR 图像双曲波提取方法[J]. 图学学报, 2020, 41(3): 399-408.
[13]	王万齐 1，马宝睿 2，李倩 2，卢文龙 1，刘玉身 2 . 基于属性相似性度量的 BIM 构件聚类[J]. 图学学报, 2020, 41(2): 304-312.
[14]	章蓉 1，陈谊 1，张梦录 1，孟可欣 2. 高维数据聚类可视分析方法综述[J]. 图学学报, 2020, 41(1): 44-56.
[15]	王美超 1，林丽 1，万露 1，高芸坤 2 . 基于感性语意模糊因子评价的图案设计源码特征集筛选[J]. 图学学报, 2019, 40(6): 1048-1056.

基于人体姿态估计与聚类的特定运动帧获取方法

Acquisition method of specific motion frame based on human attitude estimation and clustering

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价