Login / Signup

ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models.

Yuxiang ZhangJing ChenJunjie WangYaxin LiuCheng YangChufan ShiXinyu ZhuZihao LinHanwen WanYujiu YangTetsuya SakaiTian FengHayato Yamana
Published in: CoRR (2024)
Keyphrases