Quantifying Multilingual Performance of Large Language Models Across Languages.
Zihao LiYucheng ShiZirui LiuFan YangNinghao LiuMengnan DuPublished in: CoRR (2024)
Keyphrases
- language model
- language modeling
- cross lingual
- language independent
- n gram
- comparable corpora
- multilingual information retrieval
- document retrieval
- cross lingual information retrieval
- statistical machine translation
- probabilistic model
- retrieval model
- speech recognition
- cross language
- query specific
- language modelling
- information retrieval
- translation model
- parallel corpora
- query expansion
- statistical language models
- test collection
- word segmentation
- language models for information retrieval
- pseudo relevance feedback
- ad hoc information retrieval
- indian languages
- context sensitive
- text retrieval
- machine translation system
- document ranking
- chinese english
- digital libraries
- query translation
- cross language information retrieval
- retrieval effectiveness
- query terms
- smoothing methods
- term dependencies
- bilingual dictionaries
- topic modeling
- okapi bm
- statistical language modeling
- naive bayes