Sign in

Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions.

Federico BianchiMirac SuzgunGiuseppe AttanasioPaul RöttgerDan JurafskyTatsunori HashimotoJames Zou
Published in: CoRR (2023)
Keyphrases