Evaluating Large Language Models along Dimensions of Language Variation: A Systematik Invesdigatiom uv Cross-lingual Generalization.
Niyati BafnaKenton MurrayDavid YarowskyPublished in: CoRR (2024)
Keyphrases
- language modeling
- cross lingual
- language model
- parallel corpus
- indian languages
- translation model
- linguistic resources
- pseudo feedback
- cross lingual information retrieval
- information retrieval
- n gram
- cross language
- language independent
- source language
- machine translation system
- document retrieval
- probabilistic model
- retrieval model
- query expansion
- bilingual dictionaries
- test collection
- natural language
- statistical machine translation
- pseudo relevance feedback
- cross language retrieval
- query terms
- smoothing methods
- training data
- target language
- out of vocabulary
- word segmentation
- relevance model
- query translation
- vector space model
- language modeling framework