RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models models via Romanization.
Jaavid Aktar HusainRaj DabreAswanth KumarRatish PuduppullyAnoop KunchukuttanPublished in: CoRR (2024)
Keyphrases
- language model
- probabilistic model
- language modelling
- statistical language models
- language modeling
- smoothing methods
- query expansion
- n gram
- speech recognition
- retrieval model
- translation model
- relevance model
- document retrieval
- information extraction
- cross lingual
- test collection
- statistical language modeling
- digital libraries
- statistical models
- ir models
- statistical model
- probabilistic retrieval models