Login / Signup

Eraser: Jailbreaking Defense in Large Language Models via Unlearning Harmful Knowledge.

Weikai LuZiqian ZengJianwei WangZhengdong LuZelin ChenHuiping ZhuangCen Chen
Published in: CoRR (2024)
Keyphrases