Login / Signup

OR-Bench: An Over-Refusal Benchmark for Large Language Models.

Justin CuiWei-Lin ChiangIon StoicaCho-Jui Hsieh
Published in: CoRR (2024)
Keyphrases