A Bilingual OCR for Hindi-Telugu Documents and its Applications.
C. V. JawaharM. N. S. S. K. Pavan KumarS. S. Ravi KiranPublished in: ICDAR (2003)
Keyphrases
- indian languages
- document images
- optical character recognition
- ocr systems
- printed documents
- cross lingual
- language identification
- document processing
- document analysis
- page layout
- scanned documents
- document image analysis
- text lines
- character recognition
- parallel corpora
- comparable corpora
- machine translation
- word level
- spoken language
- scanned images
- web documents