Distributed Bandit Learning: Near-Optimal Regret with Efficient Communication.

Yuanhao Wang Jiachen Hu Xiaoyu Chen Liwei Wang

Published in: ICLR (2020)

Keyphrases

learning process
online learning
learning algorithm
learning tasks
communication overhead
multi agent
knowledge acquisition
learning problems
distributed environment
communication cost
reinforcement learning
optimal solution
markov chain
learning systems
human computer