Paper
18 December 2001 Translation lexicon acquisition from bilingual dictionaries
David Scott Doermann, Huanfeng Ma, Burcu Karagol-Ayan, Douglas William Oard
Author Affiliations +
Proceedings Volume 4670, Document Recognition and Retrieval IX; (2001) https://doi.org/10.1117/12.450737
Event: Electronic Imaging, 2002, San Jose, California, United States
Abstract
Bilingual dictionaries hold great potential as a source of lexical resources for training automated systems for optical character recognition, machine translation and cross-language information retrieval. In this work we describe a system for extracting term lexicons from printed copies of bilingual dictionaries. We describe our approach to page and definition segmentation and entry parsing. We have used the approach to parse a number of dictionaries and demonstrate the results for retrieval using a French-English Dictionary to generate a translation lexicon and a corpus of English queries applied to French documents to evaluation cross-language IR.
© (2001) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
David Scott Doermann, Huanfeng Ma, Burcu Karagol-Ayan, and Douglas William Oard "Translation lexicon acquisition from bilingual dictionaries", Proc. SPIE 4670, Document Recognition and Retrieval IX, (18 December 2001); https://doi.org/10.1117/12.450737
Lens.org Logo
CITATIONS
Cited by 8 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Associative arrays

Optical character recognition

Image segmentation

Image processing

Feature extraction

Error analysis

Stochastic processes

RELATED CONTENT

Bootstrapping structured page segmentation
Proceedings of SPIE (January 13 2003)
Author name recognition in degraded journal images
Proceedings of SPIE (January 16 2006)
Multiple-agent adaptation in whole-book recognition
Proceedings of SPIE (January 24 2011)
Applications of NOAA/AVHRR data on grassland monitoring
Proceedings of SPIE (August 19 1998)
Adaptive color document image binarization for text retrieval
Proceedings of SPIE (December 15 2003)

Back to Top