Login / Signup
Off-policy evaluation for learning-to-rank via interpolating the item-position model and the position-based model.
Alexander Buchholz
Ben London
Giuseppe Di Benedetto
Thorsten Joachims
Published in:
CoRR (2022)
Keyphrases
</>
probabilistic model
similarity measure
training data
dynamic programming
learning to rank