Paper
10 September 2005 Extraction of character areas from digital camera based color document images and OCR system
Y. K. Chung, S. Y. Chi, K. S. Bae, K. K. Kim, D. Jang, K. C. Kim, Y. W. Choi
Author Affiliations +
Abstract
When document images are obtained from digital cameras, many imaging problems have to be solved for better extraction of characters from the images. Variation of illumination intensity sensitively affects to color values. A simple colored document image could be converted to a monochrome image by a traditional method and then a binarization algorithm is used. But this method is not stably working to the variation of illumination because sensitivity of colors to variation of illumination. For narrowly distributed colors, the conversion is not working well. Secondly, in case that the number of colors is more than two, it is not easy to figure out which color is for character and which others are for background. This paper discusses about an extraction method from a colored document image using a color process algorithm based on characteristics of color features. Variation of intensities and color distribution are used to classify character areas and background areas. A document image is segmented into several color groups and similar color groups are merged. In final step, only two colored groups are left for the character and background. The extracted character areas from the document images are entered into optical character recognition system. This method solves a color problem, which comes from traditional scanner based OCR systems. This paper also describes the OCR system for character conversion of a colored document image. Our method is working for the colored document images of cellular phones and digital cameras in real world.
© (2005) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Y. K. Chung, S. Y. Chi, K. S. Bae, K. K. Kim, D. Jang, K. C. Kim, and Y. W. Choi "Extraction of character areas from digital camera based color document images and OCR system", Proc. SPIE 5908, Optical Information Systems III, 59080Y (10 September 2005); https://doi.org/10.1117/12.614174
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Optical character recognition

Feature extraction

Image segmentation

Digital cameras

Image processing

Cameras

Detection and tracking algorithms

Back to Top