Cross-lingual and Multilingual Spoken Term Detection for Low-Resource Indian Languages.
Sanket ShahSatarupa GuhaSimran KhanujaSunayana SitaramPublished in: CoRR (2020)
Keyphrases
- cross lingual
- indian languages
- cross lingual information retrieval
- machine translation
- cross language
- language independent
- language modeling
- multi lingual
- text classification
- out of vocabulary
- language model
- parallel corpus
- translation model
- statistical machine translation
- word segmentation
- query translation
- news articles
- feature selection
- machine translation system