Login / Signup

Probabilistic Framework of Howard's Policy Iteration: BML Evaluation and Robust Convergence Analysis.

Yutian WangYuan-Hua NiZengqiang ChenJi-Feng Zhang
Published in: IEEE Trans. Autom. Control. (2024)
Keyphrases
  • convergence analysis
  • probabilistic model
  • policy iteration
  • learning algorithm
  • bayesian networks
  • evolutionary algorithm
  • markov decision processes
  • dynamic programming
  • least squares
  • linear programming