Login / Signup

Stealthy and Persistent Unalignment on Large Language Models via Backdoor Injections.

Yuanpu CaoBochuan CaoJinghui Chen
Published in: NAACL-HLT (2024)
Keyphrases