Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2022, Vol. 43 ›› Issue (3): 387-395.DOI: 10.11996/JG.j.2095-302X.2022030387

• Image Processing and Computer Vision • Previous Articles     Next Articles

Offline handwriting mathematical symbol recognition based on improved YOLOv5s

  

  1. 1. College of Big Data and Information Engineering, Guizhou University, Guiyang Guizhou 550025, China;
    2. Semiconductor Power Device Reliability Engineering Center of Ministry of Education, Guiyang Guizhou 550025, China;
    3. Western Modernization Research Center, Guizhou University of Finance and Economics, Guiyang Guizhou 550025, China;
    4. College of Computer Science and Technology, Guizhou University, Guiyang Guizhou 550025, China
  • Online:2022-06-30 Published:2022-06-28
  • Supported by:
     National Natural Science Foundation of China (61562009); National Key Research and Development Program of China
    (2016YFD0201305-07); Guizhou University Introduced Talent Research Project (2015-29); Open Fund Project in Semiconductor
    Power Device Reliability Engineering Center of Ministry of Education (ERCMEKFJJ2019-(06))

Abstract:

Offline mathematical symbol recognition is the premise of offline mathematical expression recognition. The existing offline symbol recognition methods can only recognize symbols, but is of no help to other steps of offline expression recognition, even restricting expression recognition. Thus, an improved YOLOv5s offline symbol recognition method was proposed. Firstly, considering the small size of symbolic image, generative adversarial network (GAN) was employed to enhance the data. Secondly, from the point of view of symbolic categories, the spatial attention mechanism was introduced to YOLOv5s model, and the global maximum and global mean were pooled to enlarge the differences between categories. Finally, from the point of view of the symbol itself, the bidirectional long-short-term memory network (BiLSTM) was utilized to process the symbol feature matrix, so that the symbol feature could possess the upper and lower related information. Experimental results show that the improved YOLOv5s achieves better offline symbol recognition, with a recognition rate of 92.47%. Compared with other methods, the proposed method is effective and robust. At the same time, it can effectively avoid the problem of error accumulation in offline mathematical expression recognition and provide an effective basis for expression structure analysis.

Key words: offline handwriting mathematical symbol recognition, data enhancement, generative adversarial network;
spatial attention mechanism,
bidirectional long-short-term memory network

CLC Number: