Keyphrases
- digital libraries
- document images
- document image analysis
- document processing
- image database
- document layout
- information access
- metadata
- natural images
- document analysis
- multimedia
- language identification
- scanned documents
- page segmentation
- document image understanding
- scanned document images
- text lines
- printed documents
- viterbi algorithm
- historical documents
- ocr systems
- page layout
- optical character recognition