• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete Information from Lateral Thinking Puzzles.

Shulin HuangShirong MaYinghui LiMengzuo HuangWuhe ZouWeidong ZhangHai-Tao Zheng
Published in: CoRR (2023)
Keyphrases
  • incomplete information
  • partial information
  • autonomous agents
  • missing information
  • first order logic
  • query answering
  • nash equilibria
  • repeated games
  • open world