Sign in

Alignment is not sufficient to prevent large language models from generating harmful information: A psychoanalytic perspective.

Zi YinWei DingJia Liu
Published in: CoRR (2023)
Keyphrases