Sign in

Off-policy evaluation for learning-to-rank via interpolating the item-position model and the position-based model.

Alexander BuchholzBen LondonGiuseppe Di BenedettoThorsten Joachims
Published in: CoRR (2022)
Keyphrases
  • probabilistic model
  • similarity measure
  • training data
  • dynamic programming
  • learning to rank