Login / Signup
Tight last-iterate convergence rates for no-regret learning in multi-player games.
Noah Golowich
Sarath Pattathil
Constantinos Daskalakis
Published in:
NeurIPS (2020)
Keyphrases
</>
convergence rate
online learning
learning algorithm
reinforcement learning
lower bound
upper bound
learning tasks
objective function
supervised learning
learning problems
bandit problems