Login / Signup

Ordinary Differential Equation Methods for Markov Decision Processes and Application to Kullback-Leibler Control Cost.

Ana BusicSean P. Meyn
Published in: SIAM J. Control. Optim. (2018)
Keyphrases
  • markov decision processes
  • average cost
  • kullback leibler
  • reinforcement learning
  • ordinary differential equations
  • neural network
  • state space
  • kullback leibler divergence