Login / Signup

Adversarial Contrastive Decoding: Boosting Safety Alignment of Large Language Models via Opposite Prompt Optimization.

Zhengyue ZhaoXiaoyun ZhangKaidi XuXing HuRui ZhangZidong DuQi GuoYunji Chen
Published in: CoRR (2024)
Keyphrases