Tight Regret Bounds for Infinite-armed Linear Contextual Bandits.

Yingkai Li Yining Wang Yuan Zhou

Published in: CoRR (2019)

Keyphrases

regret bounds
lower bound
upper bound
online learning
linear regression
multi armed bandit
worst case
objective function
optimal solution
bregman divergences
probabilistic model
information theoretic