Extraction of text layout structures on document images based on statistical characterization.
Su S. ChenRobert M. HaralickIhsin T. PhillipsPublished in: Document Recognition (1995)
Keyphrases
- document images
- page layout
- document layout
- document analysis
- line extraction
- document image retrieval
- printed documents
- structure extraction
- document image analysis
- document processing
- ocr systems
- scanned document images
- scanned documents
- text extraction
- text regions
- printed text
- word level
- text lines
- language identification
- optical character recognition
- mathematical formulas
- scanned images
- historical documents
- handwritten documents
- document image understanding
- information extraction
- page segmentation
- text documents
- document structure
- complex background
- image binarization
- natural language processing
- search engine