An Approach for Stemming in Symbolically Compressed Indian Language Imaged Documents.
Utpal GarainAlok Kumar DattaPublished in: ICDAR (2005)
Keyphrases
- indian languages
- document images
- cross lingual
- information retrieval
- language identification
- information retrieval systems
- document collections
- document retrieval
- word segmentation
- text analysis
- document clustering
- user queries
- metadata
- machine translation
- hough transform
- information access
- web documents
- digital libraries
- keywords
- document image analysis