Sign in

Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks.

Xiaodong YuHao ChengXiaodong LiuDan RothJianfeng Gao
Published in: CoRR (2023)
Keyphrases