Imaged Document Text Retrieval Without OCR.
Chew Lim TanWeihua HuangZhaohui YuYi XuPublished in: IEEE Trans. Pattern Anal. Mach. Intell. (2002)
Keyphrases
- text retrieval
- document images
- retrieval systems
- document collections
- keyword extraction
- document retrieval
- information retrieval
- retrieval quality
- document processing
- printed documents
- information retrieval systems
- optical character recognition
- retrieval model
- language independent
- scanned documents
- retrieval strategies
- image retrieval
- inverted file
- latent semantic indexing
- term weighting
- query expansion
- text representation
- cross language
- relevant documents
- document analysis
- document image analysis
- handwritten documents
- character recognition
- automatic query expansion
- retrieved documents
- test collection
- user queries
- text documents
- document clustering
- tf idf
- document representation
- document structure
- retrieval process
- web search engines
- natural language processing
- relevance feedback
- knn
- digital libraries
- keywords
- bayesian networks
- multimedia