Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints.
Chaoqi WangYibo JiangChenghao YangHan LiuYuxin ChenPublished in: CoRR (2023)
Keyphrases
- constrained optimization
- kullback leibler
- soft constraints
- optimization process
- hard constraints
- constraint satisfaction
- global optimization
- np hard optimization problems
- optimization criteria
- decision variables
- combinatorial optimization
- optimization algorithm
- wide variety
- kl divergence
- optimization problems
- constraint programming
- optimization model
- penalty function
- optimization method
- divergence measure
- bayesian networks