Sign in

Why do universal adversarial attacks work on large language models?: Geometry might be the answer.

Varshini SubhashAnna BialasWeiwei PanFinale Doshi-Velez
Published in: CoRR (2023)
Keyphrases