DeepSeek LLM: Scaling Open-Source Language Models with Longtermism.
Xiao BiDeli ChenGuanting ChenShanhuang ChenDamai DaiChengqi DengHonghui DingKai DongQiushi DuZhe FuHuazuo GaoKaige GaoWenjun GaoRuiqi GeKang GuanDaya GuoJianzhong GuoGuangbo HaoZhewen HaoYing HeWenjie HuPanpan HuangErhang LiGuowei LiJiashi LiYao LiY. K. LiWenfeng LiangFangyun LinAlex X. LiuBo LiuWen LiuXiaodong LiuXin LiuYiyuan LiuHaoyu LuShanghao LuFuli LuoShirong MaXiaotao NieTian PeiYishi PiaoJunjie QiuHui QuTongzheng RenZehui RenChong RuanZhangli ShaZhihong ShaoJunxiao SongXuecheng SuJingxiang SunYaofeng SunMinghui TangBingxuan WangPeiyi WangShiyu WangYaohui WangYongji WangTong WuY. WuXin XieZhenda XieZiwei XieYiliang XiongHanwei XuR. X. XuYanhong XuDejian YangYuxiang YouShuiping YuXingkai YuB. ZhangHaowei ZhangLecong ZhangLiyue ZhangMingchuan ZhangMinghua ZhangWentao ZhangYichao ZhangChenggang ZhaoYao ZhaoShangyan ZhouShunfeng ZhouQihao ZhuYuheng ZouPublished in: CoRR (2024)
Keyphrases
- language model
- open source
- language modeling
- n gram
- probabilistic model
- document retrieval
- speech recognition
- language modelling
- information retrieval
- query expansion
- retrieval model
- query terms
- test collection
- vector space model
- ad hoc information retrieval
- retrieval effectiveness
- context sensitive
- document length
- translation model
- pseudo relevance feedback
- statistical language models
- smoothing methods
- language models for information retrieval
- term dependencies
- language model for information retrieval
- document ranking
- passage retrieval
- relevance model
- automatic speech recognition
- question answering
- keywords