Login / Signup
Hierarchical Conversational Preference Elicitation with Bandit Feedback.
Jinhang Zuo
Songwen Hu
Tong Yu
Shuai Li
Handong Zhao
Carlee Joe-Wong
Published in:
CoRR (2022)
Keyphrases
</>
preference elicitation
utility function
multi criteria
minimax regret
optimal decisions
decision theory
inverse reinforcement learning
decision making
decision makers
artificial intelligence
lower bound
decision problems
multi attribute
combinatorial auctions
privacy issues