Login / Signup
Jailbroken: How Does LLM Safety Training Fail?
Alexander Wei
Nika Haghtalab
Jacob Steinhardt
Published in:
NeurIPS (2023)
Keyphrases
</>
training set
training process
training phase
supervised learning
test set
databases
learning algorithm
information systems
support vector
multi class
semi supervised
back propagation
health care
training algorithm
feed forward neural networks