Sign in

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback.

Yuntao BaiAndy JonesKamal NdousseAmanda AskellAnna ChenNova DasSarmaDawn DrainStanislav FortDeep GanguliTom HenighanNicholas JosephSaurav KadavathJackson KernionTom ConerlySheer El ShowkNelson ElhageZac Hatfield-DoddsDanny HernandezTristan HumeScott JohnstonShauna KravecLiane LovittNeel NandaCatherine OlssonDario AmodeiTom B. BrownJack ClarkSam McCandlishChris OlahBenjamin MannJared Kaplan
Published in: CoRR (2022)
Keyphrases