Extraction of text areas in printed document images.
Jean DuongMyriam CôtéHubert EmptozChing Y. SuenPublished in: ACM Symposium on Document Engineering (2001)
Keyphrases
- document images
- printed text
- scanned documents
- optical character recognition
- text extraction
- scanned images
- line extraction
- document analysis
- printed documents
- ocr systems
- document processing
- scanned document images
- text regions
- document image analysis
- page layout
- text lines
- language identification
- word level
- page segmentation
- historical documents
- information extraction
- document image retrieval
- indian languages
- text detection
- mathematical formulas
- document layout
- image binarization
- text processing
- handwritten documents
- information retrieval
- keywords
- text retrieval
- hough transform