Reinforcement Online Learning to Rank with Unbiased Reward Shaping.

Shengyao Zhuang Zhihao Qiao Guido Zuccon

Published in: CoRR (2022)

Keyphrases

learning to rank
reward shaping
balancing exploration and exploitation
reinforcement learning
ranking functions
loss function
information retrieval
document retrieval
direct optimization
reinforcement learning algorithms
complex domains
ranking svm
evaluation measures
state space
function approximation
learning to rank algorithms
collaborative filtering
ranking algorithm
probabilistic model
retrieval systems
test collection
e learning