Transformer-Based Neural Machine Translation for Post-OCR Error Correction in Cursive Text.
Nehal YasinImran SiddiqiMomina MoetesumSadaf Abdul-RaufPublished in: ICDAR Workshops (2) (2023)
Keyphrases
- error correction
- machine translation
- natural language generation
- machine translation system
- word level
- cross lingual
- language independent
- language processing
- target language
- natural language processing
- multilingual documents
- chinese english
- statistical machine translation
- character recognition
- source language
- word recognition
- word alignment
- text retrieval
- word sense disambiguation
- language resources
- error detection
- information extraction
- cross language information retrieval
- document analysis
- natural language
- keywords
- sentence level
- text mining
- parallel corpus
- brazilian portuguese
- optical character recognition
- text documents
- information retrieval
- watermarking scheme
- data mining