C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Clustering of conversational bandits with posterior sampling for user preference learning and elicitation.
Qizhi Li
Canzhe Zhao
Tong Yu
Junda Wu
Shuai Li
Published in:
User Model. User Adapt. Interact. (2023)
Keyphrases
</>
user preferences
learning process
supervised learning
online learning
unsupervised learning
active learning
learning algorithm
reinforcement learning
clustering algorithm
probability distribution