Sign in

Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation.

Rusheb ShahQuentin Feuillade-MontixiSoroush PourArush TagadeStephen CasperJavier Rando
Published in: CoRR (2023)
Keyphrases