A Proposed S.C.O.R.E. Evaluation Framework for Large Language Models : Safety, Consensus, Objectivity, Reproducibility and Explainability.
Ting Fang TanKabilan ElangovanJasmine Chiat Ling OngNigam ShahJoseph Jao-Yiu SungTien Yin WongLan XueNan LiuHaibo WangChang Fu KuoSimon ChestermanZee Kin YeongDaniel S. W. TingPublished in: CoRR (2024)
Keyphrases
- language model
- evaluation framework
- language modeling
- evaluation process
- evaluation methodology
- information retrieval
- test collection
- document retrieval
- n gram
- probabilistic model
- retrieval model
- evaluation metrics
- query expansion
- language models for information retrieval
- semantic annotation
- query terms
- evaluation measures
- vector space model
- test set
- relevance model
- machine learning
- wordnet
- evaluation methods
- evaluation model
- natural language processing
- decision trees
- multimedia