Login / Signup
Thompson Sampling for Linearly Constrained Bandits.
Vidit Saxena
Joseph E. Gonzalez
Joakim Jaldén
Published in:
CoRR (2020)
Keyphrases
</>
linearly constrained
linear constraints
variational inequalities
multi armed bandit
random sampling
monte carlo
reinforcement learning
cooperative
stochastic systems
optimal solution