Login / Signup

Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing.

Jiabao JiBairu HouAlexander RobeyGeorge J. PappasHamed HassaniYang ZhangEric WongShiyu Chang
Published in: CoRR (2024)
Keyphrases