Login / Signup

Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs.

Víctor Gallego
Published in: CoRR (2024)
Keyphrases