Combinatorial Bandits for Incentivizing Agents with Dynamic Preferences.
Tanner FiezShreyas SekarLiyuan ZhengLillian J. RatliffPublished in: CoRR (2018)
Keyphrases
- dynamic environments
- multi agent
- decision making
- multi agent systems
- preference elicitation
- multiagent systems
- highly dynamic
- cooperative
- dynamically evolving
- intelligent agents
- autonomous agents
- multiple agents
- changing environment
- multi agent decision making
- partial information
- user preferences
- open environments
- mobile robot
- learning algorithm
- multi attribute
- agent model
- incomplete information
- virtual organization
- software agents
- reasoning process
- single agent
- partial knowledge
- stochastic systems
- contract net protocol
- distributed agents