Login / Signup

Studious Bob Fight Back Against Jailbreaking via Prompt Adversarial Tuning.

Yichuan MoYuji WangZeming WeiYisen Wang
Published in: CoRR (2024)
Keyphrases
  • multi agent
  • rule selection
  • metadata
  • private information
  • human beings
  • fine tune
  • real world
  • information retrieval
  • reinforcement learning
  • objective function
  • parameter settings