Robust Recognition of Documents by Fusing Results of Word Clusters.
Venkat RasagnaAnand KumarC. V. JawaharRaghavan ManmathaPublished in: ICDAR (2009)
Keyphrases
- robust recognition
- document clustering
- related documents
- word spotting
- word frequencies
- similar documents
- keywords
- text corpus
- text documents
- partial occlusion
- information retrieval
- document collections
- co occurrence
- information retrieval systems
- term frequency
- latent topics
- stop words
- relevant documents
- multiword
- vector space model
- word pairs
- clustering algorithm
- natural language text
- query terms
- data points
- document analysis
- text categorization
- document images
- moving objects
- pairwise