Login / Signup
Ordinary Differential Equation Methods for Markov Decision Processes and Application to Kullback-Leibler Control Cost.
Ana Busic
Sean P. Meyn
Published in:
SIAM J. Control. Optim. (2018)
Keyphrases
</>
markov decision processes
average cost
kullback leibler
reinforcement learning
ordinary differential equations
neural network
state space
kullback leibler divergence