Login / Signup
Decentralized multi-armed bandit with imperfect observations.
Keqin Liu
Qing Zhao
Bhaskar Krishnamachari
Published in:
Allerton (2010)
Keyphrases
</>
decentralized decision making
multi armed bandit
multi armed bandits
multi agent
decision making
reinforcement learning
resource allocation
bandit problems
machine learning
active learning
probability distribution
regret bounds