S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Models.
Fangyu LeiQian LiuYiming HuangShizhu HeJun ZhaoKang LiuPublished in: CoRR (2023)
Keyphrases
- language model
- systematic evaluation
- language modeling
- comprehensive evaluation
- query expansion
- document retrieval
- biomedical text
- automatic query expansion
- probabilistic model
- n gram
- pseudo relevance feedback
- language modelling
- experimental evaluation
- information retrieval
- statistical language models
- retrieval model
- test collection
- context sensitive
- smoothing methods
- query terms
- cross lingual
- relevance model
- document ranking
- text classification
- language models for information retrieval
- image retrieval