Login / Signup

RL-JACK: Reinforcement Learning-powered Black-box Jailbreaking Attack against LLMs.

Xuan ChenYuzhou NieLu YanYunshu MaoWenbo GuoXiangyu Zhang
Published in: CoRR (2024)
Keyphrases