Login / Signup

Self-Play with Adversarial Critic: Provable and Scalable Offline Alignment for Language Models.

Xiang JiSanjeev KulkarniMengdi WangTengyang Xie
Published in: CoRR (2024)
Keyphrases