Login / Signup

AttackEval: How to Evaluate the Effectiveness of Jailbreak Attacking on Large Language Models.

Dong ShuMingyu JinSuiyuan ZhuBeichen WangZihao ZhouChong ZhangYongfeng Zhang
Published in: CoRR (2024)
Keyphrases