Journal of Graphics
Previous Articles Next Articles
Online:
Published:
Abstract: Character recognition rate of OCR processing is not well for warped Chinese and English document image. To resolve this problem, a fast distortion correcting method is proposed in this paper. After the process of image preprocessing, the upper and lower boundary of each text line could be obtained by morphological dilation method. Then, the characters in each line are segmented one by one based on the vertical projection information. Every character can be described in a minimum bounding box. After that, the positions of the segmented characters are corrected according to the different structure characteristics between Chinese and English in each line. Finally, the image could be reconstructed. Experiments showed that this correction method could rectify the warped Chinese and English document image quickly and effectively. The OCR rate of the corrected images could be significantly improved.
Key words: mixture of chinese and english, warped document images, text line extraction, character segmentation
Wang Jingzhong, Sun Ting, Tong Lijing. A Fast Correcting Method for Warped Chinese and English#br# Mixed Document Images[J]. Journal of Graphics, DOI: 10.11996/JG.j.2095-302X.2015060920.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://www.txxb.com.cn/EN/10.11996/JG.j.2095-302X.2015060920
http://www.txxb.com.cn/EN/Y2015/V36/I6/920