Variance Reduction in Gradient Exploration for Online Learning to Rank.
Huazheng WangSonwoo KimEric McCord-SnookQingyun WuHongning WangPublished in: CoRR (2019)
Keyphrases
- learning to rank
- balancing exploration and exploitation
- variance reduction
- gradient estimation
- ranking functions
- loss function
- information retrieval
- reinforcement learning
- sample size
- monte carlo
- evaluation measures
- document retrieval
- ranking svm
- collaborative filtering
- retrieval systems
- ranking algorithm
- learning to rank algorithms
- multi class