欢迎访问《图学学报》 分享到:

图学学报 ›› 2021, Vol. 42 ›› Issue (3): 398-405.DOI: 10.11996/JG.j.2095-302X.2021030398

• 图像处理与计算机视觉 • 上一篇    下一篇

基于图文混排的传统服饰图像以文标图算法

  

  1. 1. 北京邮电大学人工智能学院,北京 100876; 2. 北京邮电大学数字媒体与设计艺术学院,北京 100876
  • 出版日期:2021-06-30 发布日期:2021-06-29
  • 基金资助:
    北京邮电大学基本科研业务费科研项目(2020RC26) 

A method of automatic image annotation for image-text mixed domain books

  1. 1. School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China;  2. School of Digital Media and Design Arts, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Online:2021-06-30 Published:2021-06-29
  • Supported by:
    Basic Scientific Research Funds of Beijing University of Posts and Telecommunications (2020RC26)

摘要: 针对高效解读和智能处理海量图文资料是一项极具挑战并具有实用价值工作,而自动标注精度 又面临依赖训练样本的难题,提出了一种基于数字图文混排书籍以文标图方法,由混排版式识别预处理、领域 图像语义标签构建和大标签空间以文标图算法 3 部分组成。首先,通过提出的混排版式识别离算法,提取数字 图文混排版式中图像、标题及描述文本等内容。然后,基于数字服饰图像语义标签,建立传统文化领域词库 (PatternNet),最后针对领域词库标签空间特点,提出一种改进大标签空间的以文标图算法,并在服饰类图文混 排书籍上进行仿真实验,通过对比其他数据集,验证了该算法的实效性。

关键词: 以文标图, 图像标注, 图文混排处理, 领域关键词提取

Abstract: Efficient interpretation and intelligent processing of massive text and text data is a very challenging and practical work, but the accuracy of automatic labeling is highly dependent on the quality and quantity of training samples. In this paper, an image annotation method of images and text data mixed information is proposed. The method consists of three parts: adaptive image and text separation preprocessing, domain image semantic label construction and text-based image annotation algorithm. Firstly, the proposed hybrid layout recognition algorithm is used to extract the image, title and description text in the hybrid layout of images and text data. Then, the Traditional Cultural Domain Lexicon (PatternNet) is established based on semantic tags of digital clothing image. Finally, according to the characteristics of domain lexicon's tag space, a text-based image annotation algorithm is proposed to improve the large tag space. The simulation experiment is carried out on the ethnic costumes books that images and text data hybrid layout, also compared with other data sets. The experimental results verify the effectiveness of the algorithm proposed in this paper. 

Key words:  , annotation image with text, PatternNet, digital image-text processing, domain keyword extraction

中图分类号: