Login / Signup
Finding Optimal Arms in Non-stochastic Combinatorial Bandits with Semi-bandit Feedback and Finite Budget.
Jasmin Brandt
Viktor Bengs
Björn Haddenhorst
Eyke Hüllermeier
Published in:
NeurIPS (2022)
Keyphrases
</>
finding optimal
multi armed bandits
multi armed bandit
multi armed bandit problems
regret bounds
bandit problems
stochastic systems
reinforcement learning
discrete random variables
optimal or near optimal
upper bound
lower bound
learning experience
linear regression
online learning
monte carlo
stochastic processes