Login / Signup

Complete Policy Regret Bounds for Tallying Bandits.

Dhruv MalikYuanzhi LiAarti Singh
Published in: CoRR (2022)
Keyphrases
  • regret bounds
  • multi armed bandit
  • lower bound
  • online learning
  • linear regression
  • upper bound
  • feature selection
  • bayesian networks
  • information theoretic
  • online convex optimization