Mean Field Equilibrium in Multi-Armed Bandit Game with Continuous Reward.
Xiong WangRiheng JiaPublished in: CoRR (2021)
Keyphrases
- multi armed bandit
- nash equilibrium
- game theory
- nash equilibria
- reinforcement learning
- multi armed bandits
- games with incomplete information
- repeated games
- decentralized decision making
- mixed strategy
- markov random field
- stochastic games
- resource allocation
- bayesian inference
- perfect information
- regret bounds
- multi agent
- machine learning
- closed form
- probabilistic model
- decision making
- learning algorithm