Login / Signup

Understanding Reference Policies in Direct Preference Optimization.

Yixin LiuPengfei LiuArman Cohan
Published in: CoRR (2024)
Keyphrases
  • optimization problems
  • constrained optimization
  • optimization algorithm
  • global optimization
  • databases
  • data mining
  • optimal policy
  • optimization methods
  • multi attribute
  • optimization process
  • multi criteria