Memory-Constrained No-Regret Learning in Adversarial Multi-Armed Bandits.

Xiao Xu Qing Zhao

Published in: IEEE Trans. Signal Process. (2021)

Keyphrases

multi armed bandits
online learning
lower bound
learning tasks
bandit problems
machine learning
supervised learning
learning problems
decision making
reinforcement learning
multi class