On Logarithmic Regret for Bandits with Knapsacks.

Wenbo Ren Jia Liu Ness B. Shroff

Published in: CISS (2021)

Keyphrases

regret bounds
online learning
lower bound
linear regression
knapsack problem
expert advice
multi armed bandit
upper bound
online convex optimization
neural network
bregman divergences
linear predictors
multi armed bandits
data mining
multi armed bandit problems