基于深度学习的人物肖像全自动抠图算法

doi:10.11996/JG.j.2095-302X.2022020247

图学学报 ›› 2022, Vol. 43 ›› Issue (2): 247-253.DOI: 10.11996/JG.j.2095-302X.2022020247

• 图像处理与计算机视觉 • 上一篇下一篇

基于深度学习的人物肖像全自动抠图算法

浙江科技学院理学院，浙江杭州 310000

出版日期:2022-04-30 发布日期:2022-05-07
基金资助:
浙江省自然科学基金项目(Ly20A010005)

Fully automatic matting algorithm for portraits based on deep learning

School of Science, Zhejiang University of Science and Technology, Hangzhou Zhejiang 310000, China

Online:2022-04-30 Published:2022-05-07
Supported by:
Natural Science Foundation of Zhejiang Province (Ly20A010005)

摘要/Abstract

摘要： 针对抠图任务中人物抠图完整度低、边缘不够精细化等繁琐问题，提出了一种基于深度学习
的人物肖像全自动抠图算法。算法采用三分支网络进行学习，语义分割分支(SSB)学习  图的语义信息，细节
分支(DB)学习  图的细节信息，混合分支(COM)将 2 个分支的学习结果汇总。首先算法的编码网络采用轻量
级卷积神经网络(CNN) MobileNetV2，以加速算法的特征提取过程；其次在 SSB 中加入注意力机制对图像特
征通道重要性进行加权，在 DB 加入空洞空间金字塔池化(ASPP)模块，对图像的不同感受野所提取的特征进
行多尺度融合；然后解码网络的 2 个分支通过跳级连接融合不同阶段编码网络提取到的特征进行解码；最后
将 2 个分支学习的特征融合在一起得到图像的  图。实验结果表明，该算法在公开的数据集上抠图效果优于
所对比的基于深度学习的半自动和全自动抠图算法，在实时流视频抠图的效果优于 Modnet。

关键词: 全自动抠图, 轻量级卷积神经网络, 注意力机制, 空洞空间金字塔池化, 特征融合

Abstract: Aiming at the problems of low completeness of character matting, insufficiently refined edges, and
cumbersome matting in matting tasks, an automatic matting algorithm for portraits based on deep learning was
proposed. The algorithm employed a three-branch network for learning: the semantic information of the
semantic segmentation branch (SSB) learning  graph, and the detailed information of the detail branch (DB)
learning  graph. The combination branch (COM) summarized the learning results of the two branches. First, the
algorithm’s coding network utilized a lightweight convolutional neural network MobileNetV2, aiming to
accelerate the feature extraction process of the algorithm. Second, an attention mechanism was added to the SSB
branch to weight the importance of image feature channels, the atrous spatial pyramid pooling module was added
to the DB branch, and multi-scale fusion was achieved for the features extracted from the different receptive
fields of the image. Then, the two branches of the decoding network merged the features extracted by the
encoding network at different stages through the jump connection, thus conducting the decoding. Finally, the
features learned by the two branches were fused together to obtain the image  graph. The experimental results
show that on the public data set, this algorithm can outperform the semi-automatic and fully automatic matting algorithms based on deep learning, and that the effect of real-time streaming video matting is superior to that of
Modnet.

Key words: fully automatic matting, lightweight convolutional neural network, attention mechanism, atrous spatial
pyramid pooling, feature fusion

中图分类号:

TP 391

苏常保, 龚世才. 基于深度学习的人物肖像全自动抠图算法[J]. 图学学报, 2022, 43(2): 247-253.

SU Chang-bao, GONG Shi-cai. Fully automatic matting algorithm for portraits based on deep learning[J]. Journal of Graphics, 2022, 43(2): 247-253.

[1]	张盾, 黄志开, 王欢, 吴义鹏, 王颖, 邹家豪. 基于多尺度特征实现超参进化的野生菌分类研究与应用[J]. 图学学报, 2022, 43(4): 580-589.
[2]	贺琪, 李汶龙, 宋巍, 杜艳玲, 黄冬梅, 耿立佳 . 结合残差时空注意力机制的海面温度预测算法[J]. 图学学报, 2022, 43(4): 677-684.
[3]	王素琴, 任琪, 石敏, 朱登明. 基于异常检测的产品表面缺陷检测与分割[J]. 图学学报, 2022, 43(3): 377-386.
[4]	方洪波, 万广, 陈忠辉, 黄以卫, 张文勇, 谢本亮. 基于改进 YOLOv5s 的离线手写数学符号识别[J]. 图学学报, 2022, 43(3): 387-395.
[5]	白静, 孟庆亮, 徐昊, 范有福, 杨瞻源. ST-Rec3D：基于结构和目标感知的三维重建[J]. 图学学报, 2022, 43(3): 469-477.
[6]	李扬科, 宋全博, 周元峰. 用于手势识别的时空融合网络以及虚拟签名系统[J]. 图学学报, 2022, 43(3): 504-512.
[7]	廖志伟, 金兢, 张超凡, 杨学志. 基于分层压缩激励的 ASPP 网络单目深度估计[J]. 图学学报, 2022, 43(2): 214-222.
[8]	张运波, 易鹏飞, 周东生, 张强, 魏小鹏. 深度可分离卷积和标准卷积相结合的高效行人检测器[J]. 图学学报, 2022, 43(2): 230-238.
[9]	张明, 张芳慧, 宗佳平, 宋治, 岑翼刚, 张琳娜. 基于轻量级网络的人脸检测及嵌入式实现[J]. 图学学报, 2022, 43(2): 239-246.
[10]	李翠云, 白静, 郑凉. 融合边缘增强注意力机制和 U-Net 网络的医学图像分割[J]. 图学学报, 2022, 43(2): 273-278.
[11]	刘玉杰, 张敏杰, 李宗民, 李华. 基于全局姿态感知的轻量级人体姿态估计[J]. 图学学报, 2022, 43(2): 333-341.
[12]	何国忠, 梁宇. 基于卷积神经网络的 PCB 缺陷检测[J]. 图学学报, 2022, 43(1): 21-27.
[13]	史彩娟, 陈厚儒, 葛录录, 王子雯. 注意力残差多尺度特征增强的显著性实例分割[J]. 图学学报, 2021, 42(6): 883-890.
[14]	张成 , 侯宇超 , 焦宇倩 , 白艳萍 , 李建军 . 基于三通道分离特征融合与支持向量机的混凝土图像分类研究[J]. 图学学报, 2021, 42(6): 917-923.
[15]	汪丹丹, 张旭东, 范之国, 孙锐. 基于 RGB-D 的反向融合实例分割算法[J]. 图学学报, 2021, 42(5): 767-774.

基于深度学习的人物肖像全自动抠图算法

Fully automatic matting algorithm for portraits based on deep learning

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价