Login / Signup

Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads.

Avelina Asada Hadji-KyriacouOgnjen Arandjelovic
Published in: CoRR (2024)
Keyphrases