OCR for TIFF Compressed Document Images Directly in Compressed Domain Using Text segmentation and Hidden Markov Model.
Dikshit SharmaMohammed JavedPublished in: CoRR (2022)
Keyphrases
- document images
- compressed domain
- hidden markov models
- optical character recognition
- video analysis
- text regions
- bitstream
- motion vectors
- printed documents
- document analysis
- speech recognition
- scanned documents
- text lines
- inter frame
- viterbi algorithm
- coding scheme
- anomaly detection
- information retrieval systems
- sentence level
- motion estimation
- image data
- image segmentation
- information retrieval