Login / Signup

LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete Information from Lateral Thinking Puzzles.

Shulin HuangShirong MaYinghui LiMengzuo HuangWuhe ZouWeidong ZhangHai-Tao Zheng
Published in: CoRR (2023)
Keyphrases
  • incomplete information
  • partial information
  • autonomous agents
  • missing information
  • first order logic
  • query answering
  • nash equilibria
  • repeated games
  • open world