属性语义与图谱语义融合增强的
零次学习图像识别

doi:10.11996/JG.j.2095-302X.2021060899

图学学报 ›› 2021, Vol. 42 ›› Issue (6): 899-907.DOI: 10.11996/JG.j.2095-302X.2021060899

• 图像处理与计算机视觉 • 上一篇下一篇

属性语义与图谱语义融合增强的零次学习图像识别

云南大学软件学院，云南昆明 650500

出版日期:2022-01-18 发布日期:2022-01-18
基金资助:
中国科协“青年人才托举工程”项目(W8193209)；云南省科技厅项目(202001BB050035)

Attribute and graph semantic reinforcement based zero-shot learning for image recognition

School of Software, Yunnan University, Kunming Yunnan 650500, China

Online:2022-01-18 Published:2022-01-18
Supported by:
China Association for Science and Technology “Youths Talents Support Project” (W8193209); Technology Department Program of Yunnan Province (202001BB050035)

摘要/Abstract

摘要： 零次学习(ZSL)是迁移学习在图像识别领域一个重要的分支。其主要的学习方法是在不使用未见类的情况下，通过训练可见类语义属性和视觉属性映射关系来对未见类样本进行识别，是当前图像识别领域的热点。现有的 ZSL 模型存在语义属性和视觉属性的信息不对称，语义信息不能很好地描述视觉信息，从而出现了领域漂移问题。未见类语义属性到视觉属性合成过程中部分视觉特征信息未被合成，影响了识别准确率。为了解决未见类语义特征缺失和未见类视觉特征匹配合成问题，本文设计了属性语义与图谱语义融合增强的 ZSL 模型实现 ZSL 效果的提升。该模型学习过程中使用知识图谱关联视觉特征，同时考虑样本之间的属性联系，对可见类样本和未见类样本语义信息进行了增强，采用对抗式的学习过程加强视觉特征的合成。该方法在 4 个典型的数据集上实验表现出了较好的实验效果，模型也可以合成较为细致的视觉特征，优于目前已有的 ZSL 方法。

关键词: 零次学习, 知识图谱, 生成对抗网络, 图卷积神经网络, 图像识别

Abstract: Zero-shot learning (ZSL) is an important branch of transfer learning in the field of image recognition. The main learning method is to train the mapping relationship between the semantic attributes of the visible category and the visual attributes without using the unseen category, and use this mapping relationship to identify the unseen category samples, which is a hot spot in the current image recognition field. For the existing ZSL model, there remains the information asymmetry between the semantic attributes and the visual attributes, and the semantic information cannot well describe visual information, leading to the problem of domain shift. In the process of synthesizing unseen semantic attributes into visual attributes, part of the visual feature information was not synthesized, which affected the recognition accuracy. In order to solve the problem of the lack of unseen semantic features and synthesis of unseen visual features, this paper designed a ZSL model that combined attribute and graph semantic to improve the zero-shot learning’s accuracy. In the learning process of the model, the knowledge graph was employed to associate visual features, while considering the attribute connection among samples, the semantic information of the seen and unseen samples was enhanced, and the adversarial learning process was utilized to strengthen the synthesis of visual features. The method shows good experimental results through experiments on four typical data sets, and the model can synthesize more detailed visual features, and its performance is superior to the existing ZSL methods.

Key words: , zero-shot learning, knowledge graph, generative adversarial networks, graph convolution, image recognition

中图分类号:

TP 391

汪玉金, 谢诚, 余蓓蓓, 向鸿鑫, 柳青. 属性语义与图谱语义融合增强的零次学习图像识别[J]. 图学学报, 2021, 42(6): 899-907.

WANG Yu-jin, XIE Cheng, YU Bei-bei, XIANG Hong-xin, LIU Qing . Attribute and graph semantic reinforcement based zero-shot learning for image recognition[J]. Journal of Graphics, 2021, 42(6): 899-907.

[1]	廖仕敏, 刘仰川, 朱叶晨, 王艳玲, 高欣 . 一种基于 CycleGAN 改进的低剂量 CT 图像增强网络[J]. 图学学报, 2022, 43(4): 570-578.
[2]	方洪波, 万广, 陈忠辉, 黄以卫, 张文勇, 谢本亮. 基于改进 YOLOv5s 的离线手写数学符号识别[J]. 图学学报, 2022, 43(3): 387-395.
[3]	林森 , 刘旭 . 门控融合对抗网络的水下图像增强 [J]. 图学学报, 2021, 42(6): 948-956.
[4]	宋建炜 , 邓逸川 , 苏成 , . 基于预训练语言模型的建筑施工安全事故文本的命名实体识别研究[J]. 图学学报, 2021, 42(2): 307-315.
[5]	杨勇，刘惠义. 极端低光情况下的图像增强方法[J]. 图学学报, 2020, 41(4): 520-528.
[6]	李桂，李腾. 基于姿态引导的场景保留人物视频生成[J]. 图学学报, 2020, 41(4): 539-547.
[7]	罗琪彬 1,2，蔡强 1,2 . 采用双框架生成对抗网络的图像运动模糊盲去除[J]. 图学学报, 2019, 40(6): 1056-1063.
[8]	杨世强，乔丹，弓逯琦，李小莉，李德信 . 基于 Laplace 逼近 Gaussian 过程的指节图像中层偏移测度特征学习[J]. 图学学报, 2019, 40(3): 574-582.
[9]	陈宁，王胜，黄正文. 基于特征匹配的集装箱识别与定位技术研究[J]. 图学学报, 2016, 37(4): 530-536.
[10]	邓学雄，杨志成，朱正海. 商标检索中形状特征描述的研究[J]. 图学学报, 2011, 32(6): 21-24.

属性语义与图谱语义融合增强的零次学习图像识别

Attribute and graph semantic reinforcement based zero-shot learning for image recognition

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 10

编辑推荐

Metrics

本文评价

属性语义与图谱语义融合增强的 零次学习图像识别

Attribute and graph semantic reinforcement based zero-shot learning for image recognition

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 10

编辑推荐

Metrics

本文评价

属性语义与图谱语义融合增强的零次学习图像识别