Welcome to Journal of Graphics share: 

Journal of Graphics

Previous Articles     Next Articles

A Fast Correcting Method for Warped Chinese and English#br# Mixed Document Images

  

  • Online:2015-12-31 Published:2016-01-15

Abstract: Character recognition rate of OCR processing is not well for warped Chinese and English
document image. To resolve this problem, a fast distortion correcting method is proposed in this paper.
After the process of image preprocessing, the upper and lower boundary of each text line could be
obtained by morphological dilation method. Then, the characters in each line are segmented one by
one based on the vertical projection information. Every character can be described in a minimum
bounding box. After that, the positions of the segmented characters are corrected according to the
different structure characteristics between Chinese and English in each line. Finally, the image could
be reconstructed. Experiments showed that this correction method could rectify the warped Chinese
and English document image quickly and effectively. The OCR rate of the corrected images could be
significantly improved.

Key words: mixture of chinese and english, warped document images, text line extraction, character
segmentation