Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and Evaluation.
Yunjie JiYan GongYong DengYiping PengQiang NiuBaochang MaXiangang LiPublished in: CoRR (2023)
Keyphrases
- language model
- training data
- language modeling
- n gram
- document retrieval
- probabilistic model
- query expansion
- language modelling
- speech recognition
- retrieval model
- context sensitive
- test collection
- information retrieval
- statistical language models
- learning algorithm
- smoothing methods
- training set
- multimedia
- word segmentation
- decision trees
- word error rate
- language model for information retrieval
- classification accuracy
- xml retrieval
- translation model
- term dependencies
- evaluation measures
- vector space model
- text summarization
- relevance assessments
- document ranking
- search engine