Reinforcement Online Learning to Rank with Unbiased Reward Shaping.
Shengyao ZhuangZhihao QiaoGuido ZucconPublished in: CoRR (2022)
Keyphrases
- learning to rank
- reward shaping
- balancing exploration and exploitation
- reinforcement learning
- ranking functions
- loss function
- information retrieval
- document retrieval
- direct optimization
- reinforcement learning algorithms
- complex domains
- ranking svm
- evaluation measures
- state space
- function approximation
- learning to rank algorithms
- collaborative filtering
- ranking algorithm
- probabilistic model
- retrieval systems
- test collection
- e learning