Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs.
Víctor GallegoPublished in: CoRR (2024)
Keyphrases
- synthetic data
- data sets
- optimization problems
- real world
- real image data
- global optimization
- synthetic datasets
- image alignment
- optimization algorithm
- user preferences
- optimization process
- discrete optimization
- pairwise
- similarity measure
- optimization method
- constrained optimization
- mri data
- optimization strategies
- information retrieval