Sign in

Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks.

Andy ZhouBo LiHaohan Wang
Published in: CoRR (2024)
Keyphrases