Multi-agent Heterogeneous Stochastic Linear Bandits.
Avishek GhoshAbishek SankararamanKannan RamchandranPublished in: ECML/PKDD (4) (2022)
Keyphrases
- multi agent
- stochastic systems
- regret bounds
- cooperative
- reinforcement learning
- multi agent systems
- stochastic models
- multiagent systems
- autonomous agents
- learning automata
- coalition formation
- stochastic optimization
- stochastic processes
- intelligent agents
- linear systems
- stochastic model
- single agent
- database
- linear model
- simple linear
- monte carlo
- dynamic programming
- heterogeneous agents
- real time