Login / Signup
On Logarithmic Regret for Bandits with Knapsacks.
Wenbo Ren
Jia Liu
Ness B. Shroff
Published in:
CISS (2021)
Keyphrases
</>
regret bounds
online learning
lower bound
linear regression
knapsack problem
expert advice
multi armed bandit
upper bound
online convex optimization
neural network
bregman divergences
linear predictors
multi armed bandits
data mining
multi armed bandit problems