• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

AttackEval: How to Evaluate the Effectiveness of Jailbreak Attacking on Large Language Models.

Dong ShuMingyu JinSuiyuan ZhuBeichen WangZihao ZhouChong ZhangYongfeng Zhang
Published in: CoRR (2024)
Keyphrases