Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios.
Shijue HuangWanjun ZhongJianqiao LuQi ZhuJiahui GaoWeiwen LiuYutai HouXingshan ZengYasheng WangLifeng ShangXin JiangRuifeng XuQun LiuPublished in: ACL (Findings) (2024)