LLMs Can Patch Up Missing Relevance Judgments in Evaluation.
Shivani UpadhyayEhsan KamallooJimmy LinPublished in: CoRR (2024)
Keyphrases
- relevance judgments
- test collection
- retrieval systems
- relevance feedback
- retrieval effectiveness
- web search engines
- average precision
- relevance assessments
- learning to rank
- user feedback
- evaluation metrics
- information retrieval evaluation
- ranking functions
- evaluation measures
- relevant documents
- query performance prediction
- implicit feedback
- clickthrough data
- information retrieval systems
- information retrieval
- retrieval functions
- human judgments
- user interaction
- digital libraries
- feature selection
- data sets
- human relevance judgments
- queries and relevance judgments