Login / Signup
WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs.
Seungju Han
Kavel Rao
Allyson Ettinger
Liwei Jiang
Bill Yuchen Lin
Nathan Lambert
Yejin Choi
Nouha Dziri
Published in:
CoRR (2024)
Keyphrases
</>
decision support
genetic algorithm
decision making
open systems
user friendly
risk management
software tools
computational tools
application programming interfaces