MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies.
Shengding HuYuge TuXu HanChaoqun HeGanqu CuiXiang LongZhi ZhengYewei FangYuxiang HuangWeilin ZhaoXinrong ZhangZhen Leng ThaiKai ZhangChongyi WangYuan YaoChenyang ZhaoJie ZhouJie CaiZhongwu ZhaiNing DingChao JiaGuoyang ZengDahai LiZhiyuan LiuMaosong SunPublished in: CoRR (2024)
Keyphrases
- language model
- language modeling
- probabilistic model
- n gram
- information retrieval
- speech recognition
- query expansion
- document retrieval
- retrieval model
- language modelling
- test collection
- statistical language models
- smoothing methods
- context sensitive
- query terms
- translation model
- ad hoc information retrieval
- pseudo relevance feedback
- term dependencies
- word error rate
- document length
- query processing
- language model for information retrieval
- query specific
- document ranking
- passage retrieval
- vector space model
- machine learning
- search engine