Login / Signup
Towards Robust QA Evaluation via Open LLMs.
Ehsan Kamalloo
Shivani Upadhyay
Jimmy Lin
Published in:
SIGIR (2024)
Keyphrases
</>
question answering
data sets
databases
artificial intelligence
evaluation criteria
digital libraries
evaluation methods
evaluation process
neural network
real world
machine learning
information retrieval
probabilistic model
partial occlusion
evaluation metrics