Login / Signup

Language Models are Homer Simpson! Safety Re-Alignment of Fine-tuned Language Models through Task Arithmetic.

Rishabh BhardwajDo Duc AnhSoujanya Poria
Published in: CoRR (2024)
Keyphrases