• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Clustering of conversational bandits with posterior sampling for user preference learning and elicitation.

Qizhi LiCanzhe ZhaoTong YuJunda WuShuai Li
Published in: User Model. User Adapt. Interact. (2023)
Keyphrases
  • user preferences
  • learning process
  • supervised learning
  • online learning
  • unsupervised learning
  • active learning
  • learning algorithm
  • reinforcement learning
  • clustering algorithm
  • probability distribution