Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean.
ChangSu ChoiYongbin JeongSeoyoon ParkInho WonHyeonSeok LimSangmin KimYejee KangChanhyuk YoonJaewan ParkYiseul LeeHyejin LeeYounggyun HahmHansaem KimKyungtae LimPublished in: LREC/COLING (2024)
Keyphrases
- language model
- language modeling
- machine translation system
- comparable corpora
- cross lingual
- translation model
- document retrieval
- n gram
- retrieval model
- probabilistic model
- language modelling
- statistical machine translation
- speech recognition
- information retrieval
- parallel corpus
- test collection
- query expansion
- statistical language models
- smoothing methods
- context sensitive
- language independent
- vector space model
- ad hoc information retrieval
- cross language
- parallel corpora
- bilingual dictionaries
- linguistic resources
- cross language retrieval
- language models for information retrieval
- language model for information retrieval
- natural language
- query terms
- query specific
- document ranking
- relevance model
- document length
- query translation
- retrieval effectiveness
- digital libraries