A Novel Hybrid Optical Character Recognition Approach for Digitizing Text in Forms.
Roland GraefMazen M. N. MorsyPublished in: DESRIST (2019)
Keyphrases
- optical character recognition
- ocr systems
- text recognition
- printed documents
- text extraction
- printed text
- document images
- character recognition
- text regions
- english text
- historical manuscripts
- scanned documents
- text lines
- handwriting recognition
- character segmentation
- page segmentation
- word spotting
- image binarization
- character n grams
- document analysis
- word level
- historical documents
- text mining
- viterbi algorithm
- comparative evaluation
- machine vision
- text retrieval
- information retrieval