Login / Signup
Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks.
Marco AF Pimentel
Clément Christophe
Tathagata Raha
Prateek Munjal
Praveen K. Kanithi
Shadab Khan
Published in:
CoRR (2024)
Keyphrases
</>
language model
language modeling
information retrieval
speech recognition
n gram
document retrieval
query expansion
query terms
vector space model
machine learning
probabilistic model
mixture model
context sensitive
evaluation measures
evaluation metrics