MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains.
Guoli YinHaoping BaiShuang MaFeng NanYanchao SunZhaoyang XuShen MaJiarui LuXiang KongAonan ZhangDian Ang YapYizhe zhangKarsten AhnertVik KamathMathias BerglundDominic WalshTobias GindeleJuergen WiestZhengfeng LaiXiaoming WangJiulong ShanMeng CaoRuoming PangZirui WangPublished in: CoRR (2024)
Keyphrases
- real world
- multi agent
- multiagent systems
- multi agent systems
- agent architecture
- intelligent agents
- learning capabilities
- social networks
- cross domain
- application domains
- autonomous agents
- decision making
- decision theoretic
- real world application domains
- agent systems
- multiple agents
- agent model
- comparative analysis
- cooperative
- dynamic environments
- mobile agents
- neural network
- genetic algorithm
- complex domains
- state space
- agent oriented
- artificial agents
- conversational agent