UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models.

Published in: CoRR (2024)

Keyphrases