Sign in

OpenToM: A Comprehensive Benchmark for Evaluating Theory-of-Mind Reasoning Capabilities of Large Language Models.

Hainiu XuRuncong ZhaoLixing ZhuJinhua DuYulan He
Published in: CoRR (2024)
Keyphrases