Login / Signup
Ordinary Differential Equation Methods For Markov Decision Processes and Application to Kullback-Leibler Control Cost.
Ana Busic
Sean P. Meyn
Published in:
CoRR (2016)
Keyphrases
</>
markov decision processes
kullback leibler
state space
average cost
reinforcement learning
kullback leibler divergence
ordinary differential equations
machine learning
multi agent systems
graphical models
markov chain