基于姿态引导的场景保留人物视频生成

doi:10.11996/JG.j.2095-302X.2020040539

图学学报

• 图像处理与计算机视觉 • 上一篇下一篇

基于姿态引导的场景保留人物视频生成

(安徽大学电气工程与自动化学院，安徽合肥 230601)

出版日期:2020-08-31 发布日期:2020-08-22
基金资助:
国家自然科学基金项目(61572029)；安徽省杰出青年基金项目(1908085J25)

Pose-guided scene-preserving person video generation algorithm

(School of Electrical Engineering and Automation, Anhui University, Hefei Anhui 230601, China)

Online:2020-08-31 Published:2020-08-22
Supported by:
National Natural Science Foundation of China (61572029); Anhui Outstanding Youth Fund (1908085J25)

摘要/Abstract

摘要： 人物视频生成技术是通过学习人体结构与运动的特征表示，实现从特征表示到
人物视频帧的空间生成映射。针对现有的人物视频生成算法未考虑背景环境转换及人体姿态
估计精度较低等问题，提出一种基于姿态引导的场景保留人物视频生成算法(PSPVG)。首先，
取合适的源视频和目标视频，利用分割人物外观的视频帧代替源视频帧作为网络的输入；然
后，基于GAN 的运动转换模型将源视频中的人物替换成目标人物，并保持动作一致性；最后，
引用泊松图像编辑将人物外观与源背景融合，去除边界异常像素，实现将人物自然地融入源
场景且避免改变画面背景环境和整体风格。该算法使用分割出的前景人物图代替源视频帧中
的人物，减少背景干扰，提高姿态估计精度，自然地实现运动转移过程中源场景的保留，生
成艺术性与真实性和谐并存的人物视频。

关键词: 人物视频生成, 姿态估计, 运动转换, 生成对抗网络, 图像处理

Abstract: The person video generation technology learns the feature representation of human body
structure and motion, so as to realize the spatial generation mapping from the feature representation to
the character video frame. In view of the existing person video generation algorithm lacking in the
transformation of background environment and the low accuracy of human pose estimation, a
pose-guided scene-preserving person video generation algorithm was proposed. First, the appropriate
source video and target video were selected, and the video frame with the appearance of the
segmented character served as the network input instead of the source video frame. Then, based on
GAN, a motion transformation model was employed to replace characters in source videos with target
characters and maintain the consistency of motion. Finally, the Poisson image editing was used to
fuse the character appearance with the source background, enabling the flowed advantages: (a)
removing border anomaly pixels; (b) realizing character blending naturally into the source scene; and
(c) avoiding changing the background environment and overall image style. The proposed algorithm
used the segmented foreground person image instead of the source video frame to reduce background
interference and improve the accuracy of pose estimation, thus naturally realizing scene-preserving
during the motion transfer process and producing artistic and authentic person videos.

Key words: person video generation, pose estimation, motion transfer, generative adversarial
networks, image processing

李桂，李腾. 基于姿态引导的场景保留人物视频生成[J]. 图学学报, DOI: 10.11996/JG.j.2095-302X.2020040539.

LI Gui, LI Teng. Pose-guided scene-preserving person video generation algorithm[J]. Journal of Graphics, DOI: 10.11996/JG.j.2095-302X.2020040539.

[1]	东辉, 陈鑫凯, 孙浩, 姚立纲. 基于改进 YOLOv4 和图像处理的蔬菜田杂草检测[J]. 图学学报, 2022, 43(4): 559-569.
[2]	廖仕敏, 刘仰川, 朱叶晨, 王艳玲, 高欣 . 一种基于 CycleGAN 改进的低剂量 CT 图像增强网络[J]. 图学学报, 2022, 43(4): 570-578.
[3]	蔡兴泉, 霍宇晴, 李发建, 孙海燕. 面向太极拳学习的人体姿态估计及相似度计算[J]. 图学学报, 2022, 43(4): 695-706.
[4]	方洪波, 万广, 陈忠辉, 黄以卫, 张文勇, 谢本亮. 基于改进 YOLOv5s 的离线手写数学符号识别[J]. 图学学报, 2022, 43(3): 387-395.
[5]	刘玉杰, 张敏杰, 李宗民, 李华. 基于全局姿态感知的轻量级人体姿态估计[J]. 图学学报, 2022, 43(2): 333-341.
[6]	蔡敏敏, 黄继风, 林晓, 周小平. 基于人体姿态估计与聚类的特定运动帧获取方法[J]. 图学学报, 2022, 43(1): 44-52.
[7]	汪玉金, 谢诚, 余蓓蓓, 向鸿鑫, 柳青. 属性语义与图谱语义融合增强的零次学习图像识别[J]. 图学学报, 2021, 42(6): 899-907.
[8]	林森 , 刘旭 . 门控融合对抗网络的水下图像增强 [J]. 图学学报, 2021, 42(6): 948-956.
[9]	满开亮, 汪友生, 刘继荣. 基于稠密残差网络的图像超分辨率重建算法[J]. 图学学报, 2021, 42(4): 556-562.
[10]	任好盼, 王文明, 危德健, 高彦彦, 康智慧, 王全玉. 基于高分辨率网络的人体姿态估计方法[J]. 图学学报, 2021, 42(3): 432-438.
[11]	王道累, 张天宇. 图像去雾算法的综述及分析[J]. 图学学报, 2020, 41(6): 861-870.
[12]	崔文超, 邹俊杰, 汪方毅, 唐庭龙, 夏平. OBE 理念下项目驱动的数字图像处理教学研究 [J]. 图学学报, 2020, 41(6): 1031-1038.
[13]	吴泽斌 1,张东亮 1,李基拓 2,麻菁 1,信玉峰 3. 复杂场景下的人体轮廓提取及尺寸测量[J]. 图学学报, 2020, 41(5): 740-749.
[14]	杨勇，刘惠义. 极端低光情况下的图像增强方法[J]. 图学学报, 2020, 41(4): 520-528.
[15]	罗琪彬 1,2，蔡强 1,2 . 采用双框架生成对抗网络的图像运动模糊盲去除[J]. 图学学报, 2019, 40(6): 1056-1063.

基于姿态引导的场景保留人物视频生成

Pose-guided scene-preserving person video generation algorithm

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价