τ-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains.
Shunyu YaoNoah ShinnPedram RazaviKarthik NarasimhanPublished in: CoRR (2024)
Keyphrases
- user interaction
- real world
- user behavior
- user interface
- user input
- user feedback
- user studies
- synthetic data
- multiple agents
- decision making
- wide range
- dynamic environments
- data sets
- multi agent
- multi agent systems
- autonomous agents
- software agents
- interactive segmentation
- user experience
- shape prior
- application domains
- agent model
- intelligent agents
- artificial intelligence
- agent systems
- case study
- interactive image segmentation
- real world environments
- search engine
- cross domain
- decision theoretic
- multiagent systems
- action selection