Login / Signup
Decentralized Randomly Distributed Multi-agent Multi-armed Bandit with Heterogeneous Rewards.
Mengfan Xu
Diego Klabjan
Published in:
CoRR (2023)
Keyphrases
</>
multi agent
randomly distributed
decentralized decision making
multi armed bandit
reinforcement learning
multi armed bandits
bandit problems
cooperative
multi agent systems
state space
markov decision processes
multiple agents
decision problems
learning algorithm