CodeFuse-13B: A Pretrained Multi-lingual Code Large Language Model.
Peng DiJianguo LiHang YuWei JiangWenting CaiYang CaoChaoyu ChenDajun ChenHongwei ChenLiang ChenGang FanJie GongZi GongWen HuTingting GuoZhichao LeiTing LiZheng LiMing LiangCong LiaoBingchang LiuJiachen LiuZhiwei LiuShaojun LuMin ShenGuangpei WangHuan WangZhi WangZhaogui XuJiawei YangQing YeGehao ZhangYu ZhangZelin ZhaoXunjin ZhengHailian ZhouLifu ZhuXianying ZhuPublished in: ICSE-SEIP (2024)
Keyphrases
- language model
- multi lingual
- information retrieval
- language modeling
- cross lingual
- language independent
- n gram
- information access
- document retrieval
- probabilistic model
- retrieval model
- ad hoc information retrieval
- smoothing methods
- speech recognition
- mixture model
- test collection
- query expansion
- translation model
- language identification
- machine translation
- information retrieval systems
- domain specific
- cross language
- feature vectors
- feature extraction
- search engine
- machine learning