• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Aligning Crowd Feedback via Distributional Preference Reward Modeling.

Dexun LiCong ZhangKuicai DongDerrick-Goh-Xin DeikRuiming TangYong Liu
Published in: CoRR (2024)
Keyphrases
  • co occurrence
  • neural network
  • reinforcement learning
  • expert systems
  • relevance feedback
  • user feedback
  • image registration
  • crowd simulation