Login / Signup
Distributed Bandit Learning: Near-Optimal Regret with Efficient Communication.
Yuanhao Wang
Jiachen Hu
Xiaoyu Chen
Liwei Wang
Published in:
ICLR (2020)
Keyphrases
</>
learning process
online learning
learning algorithm
learning tasks
communication overhead
multi agent
knowledge acquisition
learning problems
distributed environment
communication cost
reinforcement learning
optimal solution
markov chain
learning systems
human computer