Mapping Hindi-English Text Re-use Document Pairs.
Parth GuptaKhushboo SinghalPublished in: FIRE (2011)
Keyphrases
- english text
- language identification
- document images
- word level
- document analysis
- optical character recognition
- machine translation
- natural language generation
- information retrieval
- keywords
- text lines
- speaker identification
- document retrieval
- document collections
- retrieval systems
- character recognition
- language independent
- feature selection
- binary images
- source language
- image analysis
- information retrieval systems
- language model
- document clustering
- text documents