Login / Signup
Decentralized Randomly Distributed Multi-agent Multi-armed Bandit with Heterogeneous Rewards.
Mengfan Xu
Diego Klabjan
Published in:
NeurIPS (2023)
Keyphrases
</>
multi agent
randomly distributed
decentralized decision making
multi armed bandit
reinforcement learning
multi armed bandits
bandit problems
cooperative
multi agent systems
markov decision processes
state space
machine learning
feature selection
upper bound
nearest neighbor