Login / Signup
Minimax Regret for Cascading Bandits.
Daniel Vial
Sujay Sanghavi
Sanjay Shakkottai
R. Srikant
Published in:
CoRR (2022)
Keyphrases
</>
minimax regret
preference elicitation
utility function
stochastic programming
decision problems
misclassification costs
multistage
cross validation
incomplete information
decision theory
reward function
optimization criterion
training and test data
bayesian networks
reinforcement learning
class distribution