Sign in

"Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models.

Xinyue ShenZeyuan ChenMichael BackesYun ShenYang Zhang
Published in: CoRR (2023)
Keyphrases