Login / Signup

Towards Comprehensive and Efficient Post Safety Alignment of Large Language Models via Safety Patching.

Weixiang ZhaoYulin HuZhuojun LiYang DengYanyan ZhaoBing QinTat-Seng Chua
Published in: CoRR (2024)
Keyphrases