Login / Signup
Clustering of conversational bandits with posterior sampling for user preference learning and elicitation.
Qizhi Li
Canzhe Zhao
Tong Yu
Junda Wu
Shuai Li
Published in:
User Model. User Adapt. Interact. (2023)
Keyphrases
</>
user preferences
learning process
supervised learning
online learning
unsupervised learning
active learning
learning algorithm
reinforcement learning
clustering algorithm
probability distribution