Login / Signup

Aligning Model Evaluations with Human Preferences: Mitigating Token Count Bias in Language Model Assessments.

Roland DaynauthJason Mars
Published in: CoRR (2024)
Keyphrases
  • language model
  • probabilistic model
  • information retrieval
  • translation model
  • parameter estimation
  • n gram
  • text classification
  • test collection
  • document retrieval
  • relevance model