Login / Signup
Understanding Reference Policies in Direct Preference Optimization.
Yixin Liu
Pengfei Liu
Arman Cohan
Published in:
CoRR (2024)
Keyphrases
</>
optimization problems
constrained optimization
optimization algorithm
global optimization
databases
data mining
optimal policy
optimization methods
multi attribute
optimization process
multi criteria