Word image based latent semantic indexing for conceptual querying in document image databases.
Sameek BanerjeeGaurav HaritSantanu ChaudhuryPublished in: ICDAR (2007)
Keyphrases
- image database
- latent semantic indexing
- document space
- document representation
- vector space model
- information retrieval
- text retrieval
- vector space
- document collections
- term frequency
- image retrieval
- document clustering
- singular value decomposition
- image data
- term weighting
- latent semantic space
- image content
- keywords
- web documents
- feature space
- bag of words
- retrieval systems
- text documents
- n gram
- data fusion
- retrieval model
- language model
- database
- co occurrence
- computer vision
- document retrieval
- information retrieval systems
- object recognition
- text categorization
- feature vectors
- tf idf
- visual content
- relevant documents
- semantic information