Estimating Large Language Model Capabilities without Labeled Test Data.
Harvey Yiyun FuQinyuan YeAlbert XuXiang RenRobin JiaPublished in: CoRR (2023)
Keyphrases
- test data
- language model
- training data
- language modeling
- training and test data
- training set
- n gram
- document retrieval
- speech recognition
- test set
- query expansion
- information retrieval
- test cases
- probabilistic model
- language modelling
- retrieval model
- ad hoc information retrieval
- data sets
- context sensitive
- mixture model
- supervised learning
- query terms
- test collection
- translation model
- training samples
- learning algorithm
- word clouds
- smoothing methods
- active learning
- base classifiers
- decision trees
- text classification
- error rate
- naive bayes