OCR accuracy improvement on document images through a novel pre-processing approach.
Abdeslam El HarrajNaoufal RaissouniPublished in: CoRR (2015)
Keyphrases
- document images
- preprocessing
- optical character recognition
- document image analysis
- printed documents
- document processing
- document analysis
- document image retrieval
- scanned documents
- page layout
- page segmentation
- word level
- feature extraction
- historical documents
- ocr systems
- machine printed
- document image understanding
- post processing
- text lines
- color images
- printed text