SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages.
Wenxuan ZhangHou Pong ChanYiran ZhaoMahani AljuniedJianyu WangChaoqun LiuYue DengZhiqiang HuWeiwen XuYew Ken ChiaXin LiLidong BingPublished in: CoRR (2024)
Keyphrases
- language model
- language modeling
- cross lingual
- language independent
- n gram
- comparable corpora
- multilingual information retrieval
- cross lingual information retrieval
- probabilistic model
- statistical machine translation
- document retrieval
- retrieval model
- language modelling
- cross language
- query specific
- query expansion
- translation model
- vector space model
- information retrieval
- test collection
- statistical language models
- speech recognition
- language models for information retrieval
- document ranking
- relevance model
- digital libraries
- context sensitive
- parallel corpora
- query terms
- linguistic resources
- ad hoc information retrieval
- machine translation
- text retrieval
- document level
- cross language information retrieval
- pseudo relevance feedback
- chinese english
- search engine
- word segmentation
- statistical language modeling
- machine learning