Sign in

Prompt-Driven LLM Safeguarding via Directed Representation Optimization.

Chujie ZhengFan YinHao ZhouFandong MengJie ZhouKai-Wei ChangMinlie HuangNanyun Peng
Published in: CoRR (2024)
Keyphrases