Balancing Utility and Exposure Fairness for Integrated Ranking with Reinforcement Learning.
Wei XiaWeiwen LiuYifan LiuRuiming TangPublished in: CIKM (2022)
Keyphrases
- reinforcement learning
- ranking algorithm
- web search
- markov decision processes
- machine learning
- model free
- reinforcement learning algorithms
- function approximation
- ranking functions
- resource allocation
- optimal policy
- dynamic programming
- learning to rank
- learning process
- multi agent
- temporal difference
- temporal difference learning
- rank order
- expected utility
- robotic control
- link analysis
- optimal control
- ranked list
- utility function
- decision making