Lipi Gnani: A Versatile OCR for Documents in any Language Printed in Kannada Script.
H. R. Shiva KumarA. G. RamakrishnanPublished in: ACM Trans. Asian Low Resour. Lang. Inf. Process. (2020)
Keyphrases
- indian languages
- optical character recognition
- document images
- ocr systems
- scanned documents
- printed documents
- character n grams
- word spotting
- scanned images
- arabic documents
- language identification
- text lines
- document analysis
- character recognition
- text recognition
- printed text
- document image analysis
- cross lingual
- page layout
- document processing
- historical documents
- handwriting recognition
- document image retrieval
- programming language
- feature vectors
- preprocessing
- document retrieval
- document collections