基于图文混排的传统服饰图像以文标图算法

doi:10.11996/JG.j.2095-302X.2021030398

图学学报 ›› 2021, Vol. 42 ›› Issue (3): 398-405.DOI: 10.11996/JG.j.2095-302X.2021030398

• 图像处理与计算机视觉 • 上一篇下一篇

基于图文混排的传统服饰图像以文标图算法

1. 北京邮电大学人工智能学院，北京 100876； 2. 北京邮电大学数字媒体与设计艺术学院，北京 100876

出版日期:2021-06-30 发布日期:2021-06-29
基金资助:
北京邮电大学基本科研业务费科研项目(2020RC26)

A method of automatic image annotation for image-text mixed domain books

1. School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China; 2. School of Digital Media and Design Arts, Beijing University of Posts and Telecommunications, Beijing 100876, China

Online:2021-06-30 Published:2021-06-29
Supported by:
Basic Scientific Research Funds of Beijing University of Posts and Telecommunications (2020RC26)

摘要/Abstract

摘要： 针对高效解读和智能处理海量图文资料是一项极具挑战并具有实用价值工作，而自动标注精度又面临依赖训练样本的难题，提出了一种基于数字图文混排书籍以文标图方法，由混排版式识别预处理、领域图像语义标签构建和大标签空间以文标图算法 3 部分组成。首先，通过提出的混排版式识别离算法，提取数字图文混排版式中图像、标题及描述文本等内容。然后，基于数字服饰图像语义标签，建立传统文化领域词库 (PatternNet)，最后针对领域词库标签空间特点，提出一种改进大标签空间的以文标图算法，并在服饰类图文混排书籍上进行仿真实验，通过对比其他数据集，验证了该算法的实效性。

关键词: 以文标图, 图像标注, 图文混排处理, 领域关键词提取

Abstract: Efficient interpretation and intelligent processing of massive text and text data is a very challenging and practical work, but the accuracy of automatic labeling is highly dependent on the quality and quantity of training samples. In this paper, an image annotation method of images and text data mixed information is proposed. The method consists of three parts: adaptive image and text separation preprocessing, domain image semantic label construction and text-based image annotation algorithm. Firstly, the proposed hybrid layout recognition algorithm is used to extract the image, title and description text in the hybrid layout of images and text data. Then, the Traditional Cultural Domain Lexicon (PatternNet) is established based on semantic tags of digital clothing image. Finally, according to the characteristics of domain lexicon's tag space, a text-based image annotation algorithm is proposed to improve the large tag space. The simulation experiment is carried out on the ethnic costumes books that images and text data hybrid layout, also compared with other data sets. The experimental results verify the effectiveness of the algorithm proposed in this paper.

Key words: , annotation image with text, PatternNet, digital image-text processing, domain keyword extraction

中图分类号:

TP 391

赵海英 , 高子惠 , 邓恋 , 侯小刚 , 李宁 . 基于图文混排的传统服饰图像以文标图算法[J]. 图学学报, 2021, 42(3): 398-405.

ZHAO Hai-ying , GAO Zi-hui , DENG Lian , HOU Xiao-gang , LI Ning. A method of automatic image annotation for image-text mixed domain books[J]. Journal of Graphics, 2021, 42(3): 398-405.

[1]	于宁 1，宋海玉 1，孙东洋 2，王鹏杰 1，姚金鑫 1 . 基于深度学习中间层卷积特征的图像标注[J]. 图学学报, 2019, 40(5): 872-877.
[2]	周家琪，刘丽，崔晓萍，尚菲，矫文聪. 基于空间信息的残缺图像标注[J]. 图学学报, 2017, 38(增刊): 15-19.

基于图文混排的传统服饰图像以文标图算法

A method of automatic image annotation for image-text mixed domain books

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 2

编辑推荐

Metrics

本文评价