Unbiased Cascade Bandits: Mitigating Exposure Bias in Online Learning to Rank Recommendation.
Masoud MansouryHiman AbdollahpouriBamshad MobasherMykola PechenizkiyRobin BurkeMilad SabouriPublished in: CoRR (2021)
Keyphrases
- learning to rank
- collaborative filtering
- balancing exploration and exploitation
- ranking functions
- information retrieval
- user feedback
- loss function
- evaluation measures
- ranking svm
- document retrieval
- learning to rank algorithms
- test collection
- recommender systems
- supervised learning
- evaluation metrics
- direct optimization
- retrieval systems
- reinforcement learning
- data sets
- behavioral targeting
- keywords
- exploration exploitation dilemma
- directly optimize
- low variance
- query dependent
- face recognition
- training data
- pairwise