Tight last-iterate convergence rates for no-regret learning in multi-player games.

Noah Golowich Sarath Pattathil Constantinos Daskalakis

Published in: NeurIPS (2020)

Keyphrases

convergence rate
online learning
learning algorithm
reinforcement learning
lower bound
upper bound
learning tasks
objective function
supervised learning
learning problems
bandit problems