CodeFuse-13B: A Pretrained Multi-lingual Code Large Language Model.
Peng DiJianguo LiHang YuWei JiangWenting CaiYang CaoChaoyu ChenDajun ChenHongwei ChenLiang ChenGang FanJie GongZi GongWen HuTingting GuoZhichao LeiTing LiZheng LiMing LiangCong LiaoBingchang LiuJiachen LiuZhiwei LiuShaojun LuMin ShenGuangpei WangHuan WangZhi WangZhaogui XuJiawei YangQing YeGehao ZhangYu ZhangZelin ZhaoXunjin ZhengHailian ZhouLifu ZhuXianying ZhuPublished in: CoRR (2023)
Keyphrases
- language model
- multi lingual
- language modeling
- cross lingual
- information retrieval
- language independent
- n gram
- information access
- probabilistic model
- document retrieval
- speech recognition
- retrieval model
- test collection
- query expansion
- ad hoc information retrieval
- translation model
- smoothing methods
- mixture model
- information retrieval systems
- cross language
- language identification
- dirichlet prior
- information seeking
- maximum likelihood
- knowledge discovery
- digital libraries
- domain dependent
- bayesian networks
- search engine