CPM: A large-scale generative Chinese Pre-trained language model.
Zhengyan ZhangXu HanHao ZhouPei KeYuxian GuDeming YeYujia QinYusheng SuHaozhe JiJian GuanFanchao QiXiaozhi WangYanan ZhengGuoyang ZengHuanqi CaoShengqi ChenDaixuan LiZhenbo SunZhiyuan LiuMinlie HuangWentao HanJie TangJuanzi LiXiaoyan ZhuMaosong SunPublished in: AI Open (2021)
Keyphrases
- language model
- pre trained
- language modeling
- n gram
- document retrieval
- probabilistic model
- retrieval model
- speech recognition
- context sensitive
- training data
- training examples
- generative model
- information retrieval
- word segmentation
- query expansion
- mixture model
- test collection
- smoothing methods
- ad hoc information retrieval
- control signals
- relevance model
- translation model
- query terms
- statistical machine translation
- cross lingual
- neural network
- language modeling framework
- data sets