Login / Signup

I'm Afraid I Can't Do That: Predicting Prompt Refusal in Black-Box Generative Language Models.

Max ReuterWilliam Schulze
Published in: CoRR (2023)
Keyphrases