Login / Signup

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning.

Kaiwen WangRahul KidambiRyan SullivanAlekh AgarwalChristoph DannAndrea MichiMarco GelmiYunxuan LiRaghav GuptaAvinava DubeyAlexandre RaméJohan FerretGeoffrey CideronLe HouHongkun YuAmr AhmedAranyak MehtaLéonard HussenotOlivier BachemEdouard Leurent
Published in: CoRR (2024)
Keyphrases