Login / Signup
Thompson Sampling for Contextual Bandit Problems with Auxiliary Safety Constraints.
Samuel Daulton
Shaun Singh
Vashist Avadhanula
Drew Dimmery
Eytan Bakshy
Published in:
CoRR (2019)
Keyphrases
</>
bandit problems
exploration exploitation
multi armed bandits
contextual information
constraint programming
context sensitive
random sampling
fuzzy sets
special case
graphical models
monte carlo
context dependent
linear constraints
multi armed bandit problems