Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions.
Ruochen ZhaoWenxuan ZhangYew Ken ChiaDeli ZhaoLidong BingPublished in: CoRR (2024)
Keyphrases
- multi agent systems
- intelligent agents
- multi agent
- multiagent systems
- agent model
- autonomous agents
- software agents
- agent systems
- decision making
- dynamic environments
- agent technology
- multiple agents
- mobile agents
- cooperating agents
- interface agent
- agent environment
- agent architecture
- pedagogical agents
- decision theoretic