Login / Signup

A Wolf in Sheep's Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language Models Easily.

Peng DingJun KuangDan MaXuezhi CaoYunsen XianJiajun ChenShujian Huang
Published in: NAACL-HLT (2024)
Keyphrases