Login / Signup

Aligning Crowd Feedback via Distributional Preference Reward Modeling.

Dexun LiCong ZhangKuicai DongDerrick-Goh-Xin DeikRuiming TangYong Liu
Published in: CoRR (2024)
Keyphrases
  • co occurrence
  • neural network
  • reinforcement learning
  • expert systems
  • relevance feedback
  • user feedback
  • image registration
  • crowd simulation