Login / Signup

Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle.

Emman HaiderDaniel Perez-BeckerThomas PortetPiyush MadanAmit GargDavid MajercakWen WenDongwoo KimZiyi YangJianwen ZhangHiteshi SharmaBlake BullwinkelMartin PouliotAmanda MinnichShiven ChawlaSolianna HerreraShahed WarrethMaggie EnglerGary LopezNina ChikanovRaja Sekhar Rao DheekondaBolor-Erdene JagdagdorjRoman LutzRichard LundeenTori WesterhoffPete BryanChristian SeifertRam Shankar Siva KumarAndrew BerkleyAlex Kessler
Published in: CoRR (2024)
Keyphrases