Login / Signup

Foundational Challenges in Assuring Alignment and Safety of Large Language Models.

Usman AnwarAbulhair SaparovJavier RandoDaniel PalekaMiles TurpinPeter HaseEkdeep Singh LubanaErik JennerStephen CasperOliver SourbutBenjamin L. EdelmanZhaowei ZhangMario GüntherAnton KorinekJosé Hernández-OralloLewis HammondEric J. BigelowAlexander PanLauro LangoscoTomasz KorbakHeidi ZhangRuiqi ZhongSeán Ó hÉigeartaighGabriel RecchiaGiulio CorsiAlan ChanMarkus AnderljungLilian EdwardsYoshua BengioDanqi ChenSamuel AlbanieTegan MaharajJakob N. FoersterFlorian TramèrHe HeAtoosa KasirzadehYejin ChoiDavid Krueger
Published in: CoRR (2024)
Keyphrases