Login / Signup
Studious Bob Fight Back Against Jailbreaking via Prompt Adversarial Tuning.
Yichuan Mo
Yuji Wang
Zeming Wei
Yisen Wang
Published in:
CoRR (2024)
Keyphrases
</>
multi agent
rule selection
metadata
private information
human beings
fine tune
real world
information retrieval
reinforcement learning
objective function
parameter settings