CheckEval: Robust Evaluation Framework using Large Language Model via Checklist.
Yukyung LeeJoonghoon KimJaehee KimHyowon ChoPilsung KangPublished in: CoRR (2024)
Keyphrases
- language model
- evaluation framework
- language modeling
- probabilistic model
- n gram
- document retrieval
- evaluation methodology
- test collection
- evaluation process
- information retrieval
- query expansion
- retrieval model
- semantic annotation
- evaluation metrics
- ad hoc information retrieval
- evaluation measures
- mixture model
- software engineering
- smoothing methods
- training data
- vector space model
- recommender systems