Paper
15 March 2024 Research and implementation of business English translation software based on image character recognition
Zheng Wei
Author Affiliations +
Proceedings Volume 13075, Second International Conference on Physics, Photonics, and Optical Engineering (ICPPOE 2023); 130751U (2024) https://doi.org/10.1117/12.3026789
Event: Second International Conference on Physics, Photonics, and Optical Engineering (ICPPOE 2023), 2023, Kunming, China
Abstract
Optical Character Recognition (OCR) is a technology that integrates optical and computer technologies to convert printed text into machine-readable text. The process begins with the conversion of printed characters into a binary (black and white) image, achieved by detecting the light and dark patterns of each pixel in the document. Subsequently, OCR software applies recognition algorithms to this binary image to identify and convert the characters into digital text. This digital text is then formatted into a computer-readable format, enabling further processing and editing. This technology is instrumental in digitizing printed documents, automating data entry processes, and enabling text searches in scanned documents. In the daily information office, it is often necessary to recognize valid information in text images. This paper proposes a thresholding method for text optical character information translation recognition based on OCR technology, and uses a text image thresholding image processing method based on histogram analysis and the OTSU algorithm to segment the optical character information in text images. In the segmented foreground, a thresholding optical character information translation classifier based on OCR technology is used to realize the thresholding text optical character information translation recognition. It is verified that the proposed method can achieve high accuracy and efficiency in thresholding text optical character information translation recognition, and the recognition performance of the method is remarkable compared with other recognition methods.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Zheng Wei "Research and implementation of business English translation software based on image character recognition", Proc. SPIE 13075, Second International Conference on Physics, Photonics, and Optical Engineering (ICPPOE 2023), 130751U (15 March 2024); https://doi.org/10.1117/12.3026789
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Optical character recognition

Image processing

Histograms

Binary data

Detection and tracking algorithms

Image visualization

Visualization

Back to Top