Login / Signup

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models.

Patrick ChaoEdoardo DebenedettiAlexander RobeyMaksym AndriushchenkoFrancesco CroceVikash SehwagEdgar DobribanNicolas FlammarionGeorge J. PappasFlorian TramèrHamed HassaniEric Wong
Published in: CoRR (2024)
Keyphrases