Login / Signup
Learning Multi-Objective Rewards and User Utility Function in Contextual Bandits for Personalized Ranking.
Nirandika Wanigasekara
Yuxuan Liang
Siong Thye Goh
Ye Liu
Joseph Jay Williams
David S. Rosenblum
Published in:
IJCAI (2019)
Keyphrases
</>
utility function
multi objective
reinforcement learning
learning algorithm
multi armed bandits
personalized ranking
decision makers
contextual information
decision problems
decision theory
genetic algorithm
evolutionary algorithm
supervised learning
collaborative filtering
multi objective optimization