Role of Language Relatedness in Multilingual Fine-tuning of Language Models: A Case Study in Indo-Aryan Languages.
Tejas I. DhamechaV. Rudra MurthySamarth BharadwajKarthik SankaranarayananPushpak BhattacharyyaPublished in: EMNLP (1) (2021)
Keyphrases
- language model
- fine tuning
- language specific
- n gram
- language modeling
- cross lingual
- out of vocabulary
- comparable corpora
- language independent
- query terms
- parallel corpus
- multilingual documents
- cross language information retrieval
- document retrieval
- linguistic resources
- probabilistic model
- language resources
- speech recognition
- information retrieval
- language modelling
- indian languages
- fine tuned
- machine translation system
- statistical machine translation
- retrieval model
- test collection
- query expansion
- translation model
- context sensitive
- cross lingual information retrieval
- source language
- cross language
- machine translation
- bilingual dictionaries
- co occurrence
- smoothing methods
- target language
- pseudo relevance feedback
- query translation
- vector space model
- language models for information retrieval
- statistical language models
- parallel corpora
- document ranking
- natural language
- bayesian networks
- word segmentation
- relevance model
- transfer learning
- text classification
- information retrieval systems