EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation.
Atnafu Lambebo TonjaIsrael Abebe AzimeTadesse Destaw BelayMesay Gemeda YigezuMoges Ahmed MehamedAbinew Ali AyeleEbrahim Chekol JibrilMichael Melese WoldeyohannisOlga KolesnikovaPhilipp SlusallekDietrich KlakowShengwu XiongSeid Muhie YimamPublished in: CoRR (2024)
Keyphrases
- language model
- language modeling
- cross lingual
- language independent
- n gram
- probabilistic model
- speech recognition
- information retrieval
- document retrieval
- retrieval model
- query expansion
- comparable corpora
- language modelling
- statistical language models
- test collection
- ad hoc information retrieval
- cross language
- context sensitive
- smoothing methods
- word error rate
- document ranking
- cross lingual information retrieval
- language model for information retrieval
- language models for information retrieval
- translation model
- query specific
- out of vocabulary
- statistical machine translation
- relevance model
- xml retrieval
- pseudo relevance feedback
- vector space model
- query terms
- text classification
- digital libraries