Sign in

Routine Bandits: Minimizing Regret on Recurring Problems.

Hassan SaberLéo SaciOdalric-Ambrym MaillardAudrey Durand
Published in: ECML/PKDD (1) (2021)
Keyphrases
  • database
  • machine learning
  • online learning
  • artificial intelligence
  • reinforcement learning
  • lower bound
  • np complete