Login / Signup
An Information-Theoretic Analysis of Thompson Sampling for Large Action Spaces.
Shi Dong
Benjamin Van Roy
Published in:
CoRR (2018)
Keyphrases
</>
action space
state space
markov decision processes
real valued
continuous state
reinforcement learning
state and action spaces
skill learning
stochastic processes
continuous action
control policies
action selection
bayesian networks
markov random field
markov chain