Login / Signup

Clustering of conversational bandits with posterior sampling for user preference learning and elicitation.

Qizhi LiCanzhe ZhaoTong YuJunda WuShuai Li
Published in: User Model. User Adapt. Interact. (2023)
Keyphrases
  • user preferences
  • learning process
  • supervised learning
  • online learning
  • unsupervised learning
  • active learning
  • learning algorithm
  • reinforcement learning
  • clustering algorithm
  • probability distribution