欢迎访问《图学学报》 分享到:

图学学报 ›› 2023, Vol. 44 ›› Issue (1): 59-66.DOI: 10.11996/JG.j.2095-302X.2023010059

• 图像处理与计算机视觉 • 上一篇    下一篇

基于注意力机制的东巴画情感分类

潘森垒1(), 钱文华1(), 曹进德2, 徐丹1   

  1. 1.云南大学信息学院计算机科学与工程系,云南 昆明 650504
    2.东南大学数学学院系统科学系,江苏 南京 211189
  • 收稿日期:2022-06-19 修回日期:2022-07-11 出版日期:2023-10-31 发布日期:2023-02-16
  • 通讯作者: 钱文华
  • 作者简介:潘森垒(1996-),男,硕士研究生。主要研究方向为数字图像处理与计算机视觉。E-mail:313000234@qq.com
  • 基金资助:
    国家自然科学基金项目(62162065);云南省科技厅应用基础研究计划重点项目(2019FA044);云南省中青年学术技术带头人后备人才项目(2019HB121);云南大学研究生科研创新项目(2021Z078)

Learning attention for Dongba paintings emotion classification

PAN Sen-lei1(), QIAN Wen-hua1(), CAO Jin-de2, XU Dan1   

  1. 1. Department of Computer Science and Engineering, School of Information Science and Engineering, Yunnan University, Kunming Yunan 650504, China
    2. Department of System Science, School of Mathematics, Southeast University, Nanjing Jiangsu 211189, China
  • Received:2022-06-19 Revised:2022-07-11 Online:2023-10-31 Published:2023-02-16
  • Contact: QIAN Wen-hua
  • About author:PAN Sen-lei (1996-), master student. His main research interests cover digital image processing and computer vision. E-mail:313000234@qq.com
  • Supported by:
    National Natural Science Foundation of China(62162065);Key Project of Applied Basic Research Plan of Yunnan Provincial Department of Science and Technology(2019FA044);Yunnan Young and Middle-Aged Academic and Technical Leaders Reserve Talents Project(2019HB121);Postgraduate Research and Innovation Foundation of Yunnan University(2021Z078)

摘要:

东巴画情感多元、样本较少,基于注意力机制的分类算法能有效辅助东巴画情感分类,解决东巴画样本较少的问题。首先,按照绘画主题,将东巴画分为人物、鬼怪、动物、植物4类题材,按照绘画情感分为勤劳朴实、曼妙美好等12种情感;其次,采用编码器-解码器架构实现东巴画的情感特征提取,引入预训练模型提升分类模型的泛化性能,加速小样本东巴画情感分类收敛;最后,在解码器中设置空白注意力并融合编码器的输出序列,经由解码器的解码学习机制,学习东巴画图像语义以指导模型更准确、合理地提升对东巴画情感的学习能力。实验结果表明,基于注意力机制的分类算法相较现有方法分类准确率更高,获得了80.7%的分类准确率,解决了东巴画情感丰富、难以区分的问题。

关键词: 东巴画, 注意力机制, 情感分类, 编码器

Abstract:

Rich in emotions and limited in samples constitute the artistic characteristic of Dongba paintings. The classification algorithm of learning attention could effectively assist the emotional classification of Dongba paintings, solving the problem of limited samples of Dongba paintings. Firstly, Dongba paintings were divided into 4 themes such as figures, ghosts, animals, and plants. According to the painting emotions, Dongba paintings were divided into 12 kinds of emotions, such as industriousness, simplicity, grace, and beauty. Secondly, the architecture of encoder and decoder was employed to extract their emotional features, while the pre-training model was used to improve the generalization performance of the classification model and accelerate the convergence of emotion classification for small samples of Dongba paintings. Finally, the blank attention was set in the decoder and the output sequence of the encoder was fused. Through the decoder, the semantics of Dongba paintings were learned, guiding the model to more accurately and reasonably improve the classification ability. Experiments show that the classification algorithm of learning attention could attain a classification accuracy of 80.7% higher than the existing methods, solving the problem of rich and difficult-to-distinguish emotions in Dongba paintings.

Key words: Dongba paintings, attention, emotion classification, encoder

中图分类号: