S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Model.
Fangyu LeiQian LiuYiming HuangShizhu HeJun ZhaoKang LiuPublished in: NAACL-HLT (2024)
Keyphrases
- language model
- systematic evaluation
- language modeling
- comprehensive evaluation
- experimental evaluation
- query expansion
- biomedical text
- n gram
- probabilistic model
- document retrieval
- automatic query expansion
- pseudo relevance feedback
- retrieval model
- information retrieval
- ad hoc information retrieval
- context sensitive
- mixture model
- test collection
- smoothing methods
- vector space model
- translation model
- document representation