EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation.
Atnafu Lambebo TonjaIsrael Abebe AzimeTadesse Destaw BelayMesay Gemeda YigezuMoges Ahmed MehamedAbinew Ali AyeleEbrahim Chekol JibrilMichael Melese WoldeyohannisOlga KolesnikovaPhilipp SlusallekDietrich KlakowSeid Muhie YimamPublished in: LREC/COLING (2024)
Keyphrases
- language model
- language modeling
- cross lingual
- n gram
- language independent
- retrieval model
- document retrieval
- probabilistic model
- information retrieval
- query expansion
- comparable corpora
- language modelling
- statistical machine translation
- test collection
- statistical language models
- translation model
- context sensitive
- cross language
- document ranking
- cross lingual information retrieval
- vector space model
- pseudo relevance feedback
- smoothing methods
- speech recognition
- ad hoc information retrieval
- parallel corpora
- query specific
- out of vocabulary
- language models for information retrieval
- okapi bm
- language model for information retrieval
- relevance model
- evaluation metrics
- text classification
- digital libraries