Login / Signup

Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks.

Yue ZhouHenry Peng ZouBarbara Di EugenioYang Zhang
Published in: CoRR (2024)
Keyphrases