Login / Signup

Memory-Constrained No-Regret Learning in Adversarial Multi-Armed Bandits.

Xiao XuQing Zhao
Published in: IEEE Trans. Signal Process. (2021)
Keyphrases
  • multi armed bandits
  • online learning
  • lower bound
  • learning tasks
  • bandit problems
  • machine learning
  • supervised learning
  • learning problems
  • decision making
  • reinforcement learning
  • multi class