Login / Signup

Defending Large Language Models Against Jailbreak Attacks via Layer-specific Editing.

Wei ZhaoZhe LiYige LiYe ZhangJun Sun
Published in: CoRR (2024)
Keyphrases