A LLM Benchmark based on the Minecraft Builder Dialog Agent Task.
Chris MadgeMassimo PoesioPublished in: CoRR (2024)
Keyphrases
- conversational agents
- conversational agent
- multi agent
- multi agent systems
- multiagent systems
- intelligent agents
- agent systems
- autonomous agents
- software agents
- mobile agents
- natural language
- decision making
- dynamic environments
- multiple agents
- agent architecture
- agent model
- comparative analysis
- cooperative
- agent oriented
- real world
- development environment
- interface agent
- agent technology
- user interface
- case study
- data sets
- human users
- decision theoretic
- reasoning process
- information systems
- plan execution
- mixed initiative
- artificial intelligence
- cooperating agents
- database