Recognition of Handwritten Roman Script Using Tesseract Open source OCR Engine
Sandip RakshitSubhadip BasuPublished in: CoRR (2010)
Keyphrases
- optical character recognition
- character recognition
- character segmentation
- open source
- handwriting recognition
- word spotting
- document images
- indian languages
- handwritten characters
- document analysis
- ocr systems
- document image analysis
- hand written
- printed documents
- machine vision
- text lines
- text recognition
- word recognition
- handwritten document images
- handwritten documents
- numeral strings
- open source software
- automatic recognition
- recognition rate
- script language
- historical documents
- language identification
- recognition algorithm
- source code
- computer vision
- arabic documents
- web services
- scanned images
- handwritten text
- scanned documents
- error correction
- chinese characters
- object recognition
- post processing
- partial occlusion
- historical manuscripts
- license plate
- gray scale images