Role of Language Relatedness in Multilingual Fine-tuning of Language Models: A Case Study in Indo-Aryan Languages.
Tejas Indulal DhamechaV. Rudra MurthySamarth BharadwajKarthik SankaranarayananPushpak BhattacharyyaPublished in: CoRR (2021)
Keyphrases
- language model
- fine tuning
- language specific
- n gram
- language modeling
- out of vocabulary
- cross lingual
- comparable corpora
- language independent
- query terms
- cross language information retrieval
- language resources
- multilingual documents
- parallel corpus
- linguistic resources
- machine translation system
- probabilistic model
- translation model
- document retrieval
- indian languages
- information retrieval
- statistical machine translation
- language modelling
- test collection
- natural language
- retrieval model
- target language
- speech recognition
- language models for information retrieval
- source language
- statistical language models
- cross lingual information retrieval
- query expansion
- fine tuned
- context sensitive
- smoothing methods
- chinese english
- machine translation
- co occurrence
- cross language retrieval
- relevance model
- cross language
- pseudo relevance feedback
- vector space model
- bilingual dictionaries
- text classification
- information retrieval systems
- term dependencies
- parallel corpora
- document ranking