A Wolf in Sheep's Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language Models Easily.

Published in: NAACL-HLT (2024)

Keyphrases