Online Diverse Learning to Rank from Partial-Click Feedback.
Prakhar GuptaGaurush HiranandaniHarvineet SinghBranislav KvetonZheng WenIftikhar Ahamath BurhanuddinPublished in: CoRR (2018)
Keyphrases
- learning to rank
- balancing exploration and exploitation
- user feedback
- ranking functions
- loss function
- information retrieval
- evaluation measures
- web search
- learning to rank algorithms
- ranking svm
- behavioral targeting
- direct optimization
- document retrieval
- relevance judgments
- test collection
- search engine
- directly optimize
- precision and recall
- pairwise
- learning algorithm
- reinforcement learning
- online advertising
- query dependent
- ranking algorithm
- supervised learning
- user behavior
- web search engines
- data sets
- exploration exploitation dilemma