Login / Signup

Towards Understanding the Influence of Reward Margin on Preference Model Performance.

Bowen QinDuanyu FengXi Yang
Published in: CoRR (2024)
Keyphrases
  • preference model
  • decision makers
  • user preferences
  • reinforcement learning
  • decision trees
  • decision problems