Word-Level Thirteen Official Indic Languages Database for Script Identification in Multi-script Documents.
Sk Md ObaidullahK. C. SantoshChayan HalderNibaran DasKaushik RoyPublished in: RTIP2R (2016)
Keyphrases
- word level
- language independent
- document analysis
- chinese text retrieval
- word spotting
- indian languages
- word recognition
- information retrieval systems
- document collections
- text retrieval
- xml documents
- parallel corpora
- document images
- source language
- language identification
- information retrieval
- word segmentation
- document level
- text analysis
- sentence level
- target language
- document clustering
- machine translation
- document retrieval
- n gram
- keywords
- character recognition
- statistical machine translation
- user queries
- handwritten documents
- web documents