Integrating Optical Character Recognition and Machine Translation of Historical Documents.
Haithem AfliAndy WayPublished in: LT4DH@COLING (2016)
Keyphrases
- machine translation
- optical character recognition
- historical documents
- handwriting recognition
- document images
- historical manuscripts
- handwritten document images
- character recognition
- natural language processing
- word recognition
- cross lingual
- word level
- target language
- language independent
- information extraction
- printed documents
- word spotting
- natural language
- document analysis
- cross language information retrieval
- document image analysis
- handwritten text
- statistical machine translation
- handwritten documents
- machine vision
- document image retrieval