Word retrieval in document images without OCR.
Simone MarinaiEmanuele MarinoGiovanni SodaPublished in: SEBD (2003)
Keyphrases
- document images
- document image retrieval
- document analysis
- word level
- printed documents
- handwritten documents
- page layout
- optical character recognition
- indian languages
- word spotting
- page segmentation
- scanned documents
- document image analysis
- document processing
- word recognition
- machine printed text
- printed text
- language identification
- text lines
- document retrieval
- ocr systems
- document image understanding
- word segmentation
- comparative evaluation
- retrieval model
- machine printed
- query expansion
- line extraction
- test collection
- n gram
- relevance feedback
- historical documents
- character recognition
- text retrieval
- scanned document images