Welcome to Journal of Graphics share: 

Journal of Graphics

Previous Articles     Next Articles

Fine-Grained Image Classification Based on Text and Visual Information

  

  1. (1. Beijing Forever Technology Co. Ltd, Beijing 100011, China;
    2. School of Control and Computer Engineering, North China Electric Power University, Beijing 102206, China)
  • Online:2019-06-30 Published:2019-08-02

Abstract: The fine-grained image classification generally only focuses on the partial visual information of image, but in some problems the text information of partial image has a direct relationship with the classification result. By extracting the semantic information of the image text, the image classification effect can be further improved. We comprehensively consider the visual information and local text information of image, and then propose an end-to-end classification model to solve the problem of fine-grained image classification. On the one hand, the deep convolutional neural network is used to obtain the visual features of the image, on the other hand, according to the proposed end-to-end text recognition network, the text information of the image is extracted, and then the visual feature and the text feature are merged by the correlation calculation module and sent to the classification network. Finally, we test the results of our method in the image classification on the public dataset Con-Text, and also verify the end-to-end text recognition network on the SVT dataset, which is better than the previous method.

Key words:  computer vision, fine-grained image classification, scene text recognition, convolution neural network, attention mechanism