Content-level Annotation of Large Collection of Printed Document Images.
Anand KumarC. V. JawaharPublished in: ICDAR (2007)
Keyphrases
- document images
- optical character recognition
- scanned documents
- printed text
- word level
- document image analysis
- metadata
- multimedia collections
- scanned images
- effective retrieval
- page layout
- document analysis
- document image understanding
- multimedia
- document processing
- printed documents
- digital libraries
- historical documents
- scanned document images
- mathematical formulas
- language identification
- document collections
- page segmentation
- document clustering
- image analysis
- document image retrieval
- image retrieval
- image processing