Sign in

Direct Language Model Alignment from Online AI Feedback.

Shangmin GuoBiao ZhangTianlin LiuTianqi LiuMisha KhalmanFelipe LlinaresAlexandre RaméThomas MesnardYao ZhaoBilal PiotJohan FerretMathieu Blondel
Published in: CoRR (2024)
Keyphrases