Decentralized Randomly Distributed Multi-agent Multi-armed Bandit with Heterogeneous Rewards.

Mengfan Xu Diego Klabjan

Published in: CoRR (2023)

Keyphrases

multi agent
randomly distributed
decentralized decision making
multi armed bandit
reinforcement learning
multi armed bandits
bandit problems
cooperative
multi agent systems
state space
markov decision processes
multiple agents
decision problems
learning algorithm