Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks.

Published in: CoRR (2023)

Keyphrases