An Information-Theoretic Analysis of Thompson Sampling for Large Action Spaces.

Shi Dong Benjamin Van Roy

Published in: CoRR (2018)

Keyphrases

action space
state space
markov decision processes
real valued
continuous state
reinforcement learning
state and action spaces
skill learning
stochastic processes
continuous action
control policies
action selection
bayesian networks
markov random field
markov chain